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PREFACE 


Unlike Newton's mechanics, or Maxwell’s electrodynamics, or Einstein's relativity, 
quantum theory was not created—or even definitively packaged—by one individ- 
ual, and it retains to this day some of the scars of its exhilarating but traumatic 
youth. There is no general consensus as to what its fundamental principles are, how 
it should be taught, or what it really “means.” Every competent physicist can “do” 
quantum mechanics, but the stories we tell ourselves about what we are doing are 
as various as the tales of Scheherazade, and almost as implausible. Niels Bohr said, 
“If you are not confused by quantum physics then you haven’t really understood 
it”; Richard Feynman remarked, “I think I can safely say that nobody understands 
quantum mechanics.” 

The purpose of this book is to teach you how to do quantum mechanics. Apart 
from some essential background in Chapter 1, the deeper quasi-philosophical ques- 
tions are saved for the end. I do not believe one can intelligently discuss what 
quantum mechanics means until one has a firm sense of what quantum mechan- 
ics does. But if you absolutely cannot wait, by all means read the Afterword 
immediately following Chapter 1. 

Not only is quantum theory conceptually rich, it is also technically difficult, 
and exact solutions to all but the most artificial textbook examples are few and far 
between. It is therefore essential to develop special techniques for attacking more 
realistic problems. Accordingly, this book is divided into two parts;! Part I covers 
the basic theory, and Part II assembles an arsenal of approximation schemes, with 
illustrative applications. Although it is important to keep the two parts /ogically 
separate, it is not necessary to study the material in the order presented here. Some 


lThis structure was inspired by David Park's classie text, Introduction to the Quantum Theory, 
3rd ed.. McGraw-Hill, New York (1992). 
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Preface 


instructors, for example, may wish to treat time-independent perturbation theory 
immediately after Chapter 2. 

This book is intended for a one-semester or one-year course at the junior or 
senior level. A one-semester course will have to concentrate mainly on Part I; 
a full-year course should have room for supplementary material beyond Part II. 
The reader must be familiar with the rudiments of linear algebra (as summarized 
in the Appendix), complex numbers, and calculus up through partial derivatives; 
some acquaintance with Fourier analysis and the Dirac delta function would help. 
Elementary classical mechanics is essential, of course, and a little electrodynamics 
would be useful in places. As always, the more physics and math you know the 
easier it will be, and the more you will get out of your study. But I would like 
to emphasize that quantum mechanics is not, in my view, something that flows 
smoothly and naturally from earlier theories. On the contrary, it represents an 
abrupt and revolutionary departure from classical ideas, calling forth a wholly new 
and radically counterintuitive way of thinking about the world. That, indeed, is 
what makes it such a fascinating subject. 

At first glance, this book may strike you as forbiddingly mathematical. We 
encounter Legendre, Hermite, and Laguerre polynomials, spherical harmonics, 
Bessel, Neumann, and Hankel functions, Airy functions, and even the Riemann 
zeta function—not to mention Fourier transforms, Hilbert spaces, hermitian oper- 
ators, Clebsch-Gordan coefficients, and Lagrange multipliers. Is all this baggage 
really necessary? Perhaps not, but physics is like carpentry: Using the right tool 
makes the job easier, not more difficult, and teaching quantum mechanics without 
the appropriate mathematical equipment is like asking the student to dig a foun- 
dation with a screwdriver. (On the other hand, it can be tedious and diverting if 
the instructor feels obliged to give elaborate lessons on the proper use of each 
tool. My own instinct is to hand the students shovels and tell them to start dig- 
ging. They may develop blisters at first, but I still think this is the most efficient 
and exciting way to learn.) At any rate, I can assure you that there is no deep 
mathematics in this book, and if you run into something unfamiliar, and you don’t 
find my explanation adequate, by all means ask someone about it, or look it up. 
There are many good books on mathematical methods —I particularly recommend 
Mary Boas, Mathematical Methods in the Physical Sciences, 2nd ed., Wiley, New 
York (1983), or George Arfken and Hans-Jurgen Weber, Mathematical Methods for 
Physicists, 5th ed., Academic Press, Orlando (2000). But whatever you do, don't 
let the mathematics — which, for us, is only a tool —interfere with the physics. 

Several readers have noted that there are fewer worked examples in this book 
than is customary, and that some important material is relegated to the problems. 
This is no accident. I don't believe you can learn quantum mechanics without doing 
many exercises for yourself. Instructors should of course go over as many problems 
in class as time allows, but students should be warned that this is not a subject 
about which anyone has natural intuitions—you're developing a whole new set 
of muscles here, and there is simply no substitute for calisthenics. Mark Semon 
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suggested that I offer a "Michelin Guide" to the problems, with varying numbers 
of stars to indicate the level of difficulty and importance. This seemed like a good 
idea (though, like the quality of a restaurant, the significance of a problem is partly 
a matter of taste); I have adopted the following rating scheme: 


* an essential problem that every reader should study; 
**  asomewhat more difficult or more peripheral problem; 
* * * an unusually challenging problem, that may take over an hour. 


(No stars at all means fast food: OK if you're hungry, but not very nourishing.) 
Most of the one-star problems appear at the end of the relevant section; most of 
the three-star problems are at the end of the chapter. A solution manual is available 
(to instructors only) from the publisher. 

In preparing the second edition I have tried to retain as much as possible the 
spirit of the first. The only wholesale change is Chapter 3, which was much too 
long and diverting; it has been completely rewritten, with the background material 
on finite-dimensional vector spaces (a subject with which most students at this level 
are already comfortable) relegated to the Appendix. I have added some examples 
in Chapter 2 (and fixed the awkward definition of raising and lowering operators 
for the harmonic oscillator). In later chapters I have made as few changes as I 
could, even preserving the numbering of problems and equations, where possible. 
The treatment is streamlined in places (a better introduction to angular momentum 
in Chapter 4, for instance, a simpler proof of the adiabatic theorem in Chapter 
10, and a new section on partial wave phase shifts in Chapter 11). Inevitably, the 
second edition is a bit longer than the first, which I regret, but I hope it is cleaner 
and more accessible. 

I have benefited from the comments and advice of many colleagues, who 
read the original manuscript, pointed out weaknesses (or errors) in the first edition, 
suggested improvements in the presentation, and supplied interesting problems. I 
would like to thank in particular P. K. Aravind (Worcester Polytech), Greg Benesh 
(Baylor) David Boness (Seattle), Burt Brody (Bard), Ash Carter (Drew), Edward 
Chang (Massachusetts), Peter Collings (Swarthmore), Richard Crandall (Reed), 
Jeff Dunham (Middlebury), Greg Elliott (Puget Sound), John Essick (Reed), Gregg 
Franklin (Carnegie Mellon), Henry Greenside (Duke), Paul Haines (Dartmouth), 
J. R. Huddle (Navy), Larry Hunter (Amherst), David Kaplan (Washington), Alex 
Kuzmich (Georgia Tech), Peter Leung (Portland State), Tony Liss (Illinois), Jeffry 
Mallow (Chicago Loyola), James McTavish (Liverpool), James Nearing (Miami), 
Johnny Powell (Reed), Krishna Rajagopal (MIT), Brian Raue (Florida Interna- 
tional), Robert Reynolds (Reed), Keith Riles (Michigan), Mark Semon (Bates), 
Herschel Snodgrass (Lewis and Clark), John Taylor (Colorado), Stavros Theodor- 
akis (Cyprus), A. S. Tremsin (Berkeley), Dan Velleman (Amherst), Nicholas 
Wheeler (Reed), Scott Willenbrock (Illinois), William Wootters (Williams), Sam 
Wurzel (Brown), and Jens Zorn (Michigan). 
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PARTI THEORY 


THE WAVE FUNCTION 


1.1 THE SCHRODINGER EQUATION 


Imagine a particle of mass m, constrained to move along the x-axis, subject to 
some specified force F(x.f) (Figure 1.1). The program of classical mechanics 
is to determine the position of the particle at any given time: x(t). Once we 
know that, we can figure out the velocity (v = dx/dt), the momentum (p = 
mv), the kinetic energy (T = (1/2)mv?), or any other dynamical variable of 
interest. And how do we go about determining x(f)? We apply Newton's sec- 
ond law: F = ma. (For conservative systems—the only kind we shall con- 
sider, and, fortunately, the only kind that occur at the microscopic level—the 
force can be expressed as the derivative of a potential energy function! F — 
—8V/dx, and Newton’s law reads m d?x/dt? = —8V/dx.) This, together with 
appropriate initial conditions (typically the position and velocity at t = 0), deter- 
mines x(t). 

Quantum mechanics approaches this same problem quite differently. In this 
case what we're looking for is the particle’s wave function, V(x, 7), and we get 
it by solving the Schródinger equation: 


Vw. [1.1] 


~ 2m 8x? 


! Magnetic forces are an exception. but let's not worry about them just yet. By the way. we shall 
assume throughout this book that the motion is nonrelativistic (v < c). 
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e a — 
x(t) 


FIGURE 1.1: A “particle” constrained to move in one dimension under the influence 
of a specified force. 


Here i is the square root of —1, and fi is Planck's constant—or rather, his original 
constant (A) divided by 27: 


h 
h = — = 1.054572 x 10734J s. [1.2] 
2x 


The Schródinger equation plays a role logically analogous to Newton's second 
law: Given suitable initial conditions (typically, V (x, 0)), the Schrödinger equation 
determines W(x. f) for all future time, just as, in classical mechanics, Newton's 
law determines x(t) for all future time? 


1.2 THE STATISTICAL INTERPRETATION 


But what exactly is this “wave function," and what does it do for you once you've 
got it? After all, a particle, by its nature, is localized at a point, whereas the wave 
function (as its name suggests) is spread out in space (it's a function of x, for any 
given time f). How can such an object represent the state of a particle? The answer 
is provided by Born's statistical interpretation of the wave function, which says 
that |Y (x, 1)|? gives the probability of finding the particle at point x, at time t —or, 
more precisely, 


between a and b, at time f. 


2b T . JAN 
| Ivo. nf dx = | probability of finding the particle | [1.3] 
é 


al 


Probability is the area under the graph of ||”. For the wave function in Figure 1.2, 
you would be quite likely to find the particle in the vicinity of point A, where ||? 
is large, and relatively unlikely to find it near point B. 


?For a delightful first-hand account of the origins of the Schrödinger equation see the article by 
Felix Bloch in Physics Today. December 1976. 

3The wave function itself is complex, but |Y? = w* (where W* is the complex conjugate of 
W) is real and nonnegative—as a probability. of course, must be. 
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FIGURE 1.2: A typical wave function. The shaded area represents the probability of 
finding the particle between a and b. The particle would be relatively likely to be found 
near A, and unlikely to be found near B. 


The statistical interpretation introduces a kind of indeterminacy into quan- 
tum mechanics, for even if you know everything the theory has to tell you about 
the particle (to wit: its wave function), still you cannot predict with certainty the 
outcome of a simple experiment to measure its position—all quantum mechan- 
ics has to offer is statistical information about the possible results. This inde- 
terminacy has been profoundly disturbing to physicists and philosophers alike, 
and it is natural to wonder whether it is a fact of nature. or a defect in the 
theory. 

Suppose I do measure the position of the particle, and I find it to be at point 
C.* Question: Where was the particle just before I made the measurement? There 
are three plausible answers to this question, and they serve to characterize the main 
schools of thought regarding quantum indeterminacy: 

1. The realist position: The particle was at C. This certainly seems like a sen- 
sible response, and it is the one Einstein advocated. Note, however, that if this is 
true then quantum mechanics is an incomplete theory, since the particle really was 
at C, and yet quantum mechanics was unable to tell us so. To the realist, indeter- 
minacy is not a fact of nature, but a reflection of our ignorance. As d'Espagnat put 
it, "the position of the particle was never indeterminate, but was merely unknown 
to the experimenter."? Evidently V is not the whole story —some additional infor- 
mation (known as a hidden variable) is needed to provide a complete description 
of the particle. 

2. The orthodox position: The particle wasn't really anywhere. It was the act 
of measurement that forced the particle to “take a stand” (though how and why it 
decided on the point C we dare not ask). Jordan said it most starkly: “Observations 
not only disturb what is to be measured, they produce it... We compel (the 


4Of course, no measuring instrument is perfectly precise: what I mean is that the particle was 
found in the vicinity of C, to within the tolerance of the equipment. 


?Bernard d Espagnat, "The Quantum Theory and Reality" (Scientific American, November 1979. 
p. 165). 
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particle) to assume a definite position."? This view (the so-called Copenhagen 
interpretation), is associated with Bohr and his followers. Among physicists it 
has always been the most widely accepted position. Note, however, that if it is 
correct there is something very peculiar about the act of measurement—something 
that over half a century of debate has done precious little to illuminate. 

3. The agnostic position: Refuse to answer. This is not quite as silly as it 
sounds —after all, what sense can there be in making assertions about the status 
of a particle before a measurement, when the only way of knowing whether you 
were right is precisely to conduct a measurement, in which case what you get is no 
longer “before the measurement?" It is metaphysics (in the pejorative sense of the 
word) to worry about something that cannot, by its nature, be tested. Pauli said: 
"One should no more rack one's brain about the problem of whether something one 
cannot know anything about exists all the same, than about the ancient question of 
how many angels are able to sit on the point of a needle."? For decades this was the 
"fall-back" position of most physicists: They'd try to sell you the orthodox answer, 
but if you were persistent they'd retreat to the agnostic response, and terminate the 
conversation. 

Until fairly recently, all three positions (realist, orthodox, and agnostic) had 
their partisans. But in 1964 John Bell astonished the physics community by showing 
that it makes an observable difference whether the particle had a precise (though 
unknown) position prior to the measurement, or not. Bell's discovery effectively 
eliminated agnosticism as a viable option, and made it an experimental question 
whether 1 or 2 is the correct choice. I'll return to this story at the end of the book, 
when you will be in a better position to appreciate Bell's argument; for now, suffice 
it to say that the experiments have decisively confirmed the orthodox interpreta- 
tion: A particle simply does not have a precise position prior to measurement, any 
more than the ripples on a pond do; it is the measurement process that insists on 
one particular number, and thereby in a sense creates the specific result, limited 
only by the statistical weighting imposed by the wave function. 

What if I made a second measurement, immediately after the first? Would Í 
get C again, or does the act of measurement cough up some completely new num- 
ber each time? On this question everyone is in agreement: A repeated measurement 
(on the same particle) must return the same value. Indeed, it would be tough to 
prove that the particle was really found at C in the first instance, if this could not 
be confirmed by immediate repetition of the measurement. How does the orthodox 


SQuoted in a lovely article by N. David Mermin, "Is the moon there when nobody looks?" 
(Physics Today. April 1985. p. 38). 


7Quoted by Mermin (footnote 6). p. 40. 


SThis statement is a little too strong: There remain a few theoretical and experimental loopholes, 
some of which I shall discuss in the Afterword. There exist viable nonlocal hidden variable theories 
(notably David Bohm's). and other formulations (such as the many worlds interpretation) that do not 
fit cleanly into any of my three categories. But | think it is wise. at least from a pedagogical point of 
view. to adopt a clear and coherent platform at this stage. and worry about the alternatives later. 
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FIGURE 1.3: Collapse of the wave function: graph of |W|? immediately after a 
measurement has found the particle at point C. 


interpretation account for the fact that the second measurement is bound to yield 
the value C? Evidently the first measurement radically alters the wave function, 
so that it is now sharply peaked about C (Figure 1.3). We say that the wave func- 
tion collapses, upon measurement, to a spike at the point C (it soon spreads out 
again, in accordance with the Schrédinger equation, so the second measurement 
must be made quickly). There are, then, two entirely distinct kinds of physical pro- 
cesses: "ordinary" ones, in which the wave function evolves in a leisurely fashion 
under the Schrödinger equation, and "measurements," in which Y suddenly and 
discontinuously collapses.? 


1.3 PROBABILITY 


1.3.1 Discrete Variables 


Because of the statistical interpretation, probability plays a central role in quantum 
mechanics, so I digress now for a brief discussion of probability theory. It is mainly 
a question of introducing some notation and terminology, and I shall do it in the 
context of a simple example. 

Imagine a room containing fourteen people, whose ages are as follows: 


one person aged 14, 
one person aged 15, 


three people aged 16, 


?The role of measurement in quantum mechanics is so critical and so bizarre that you may 
well be wondering what precisely constitutes a measurement. Does it have to do with the interaction 
between a microscopic (quantum) system and a macroscopic (classical) measuring apparatus (as Bohr 
insisted), or is it characterized by the leaving of a permanent "record" (as Heisenberg claimed), or does 
it involve thc intervention of a conscious "observer" (as Wigner proposed)? I'll return to this thorny 
issue in the Afterword: for the moment let's take the naive view: A measurement is the kind of thing 
that a scientist does in the laboratory. with rulers, stopwatches, Geiger counters, and so on. 
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two people aged 22, 
two people aged 24, 
five people aged 25. 


If we let N(j) represent the number of people of age j, then 


N(14) = 1, 
N(15) = 1, 
N(16) 23, 
N(22) =2, 
N(24) =2, 
N(25) =5, 


while N(17), for instance, is zero. The total number of people in the room is 
oo 
N=) NQ). [1.4] 
j=0 


(In the example, of course, N = 14.) Figure 1.4 is a histogram of the data. The 
following are some questions one might ask about this distribution. 

Question 1. If you selected one individual at random from this group, what 
is the probability that this person's age would be 15? Answer: One chance in 
14, since there are 14 possible choices, all equally likely, of whom only one has 
that particular age. If P(j) is the probability of getting age j, then P(14) — 
1/14, P(15) 2 1/14, P(16) — 3/14, and so on. In general, 


P(j) 2 ——. [1.5] 


NG) 


10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 j 


FIGURE 1.4: Histogram showing the number of people, NỌ), with age j, for the 
distribution in Section 1.3.1. 
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Notice that the probability of getting either 14 or 15 is the. sum of the individual 
probabilities (in this case, 1/7). In particular, the sum of all the probabilities is 
| — you're certain to get some age: 


Pg l. [1.6] 


j=0 


Question 2. What is the most probable age? Answer: 25, obviously; five 
people share this age, whereas at most three have any other age. In general, the 
most probable j is the j for which P(j) is a maximum. 

Question 3. What is the median age? Answer: 23, for 7 people are younger 
than 23, and 7 are older. (In general, the median is that value of j such that the 
probability of getting a larger result is the same as the probability of getting a 
smaller result.) 

Question 4. What is the average (or mean) age? Answer: 


(14) + (15) + 3(16) + 2(22) + 2(24) + 5(25) 294 
14 T j4 


= 21. 


In general, the average value of j (which we shall write thus: (/)) is 


iN 
u = £480 = Dur [1.7] 


Notice that there need not be anyone with the average age or the median age —in 
this example nobody happens to be 21 or 23. In quantum mechanics the average 
is usually the quantity of interest; in that context it has come to be called the 
expectation value. It's a misleading term, since it suggests that this is the outcome 
you would be most likely to get if you made a single measurement (that would 
be the most probable value, not the average value)—but I’m afraid we're stuck 
with it. 

Question 5. What is the average of the squares of the ages? Answer: You 
could get 14? — 196, with probability 1/14, or 15? — 225, with probability 1/14, 
or 16? — 256, with probability 3/14, and so on. The average, then, is 


Ue» J PO). [1.8] 


j=0 


In general, the average value of some function of j is given by 


(FS > IDP). [1.9] 


j=0 
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N(/) N(J) 


1 2 3 4 56788910 j 123 45686789910 j 


FIGURE 1.5: Two histograms with the same median, same average, and same most 
probable value, but different standard deviations. 


(Equations 1.6, 1.7, and 1.8 are, if you like, special cases of this formula.) Beware: 
The average of the squares, (j?), is not equal, in general, to the square of the 
average, (j)?. For instance, if the room contains just two babies, aged 1 and 3, 
then (x?) — 5, but (x)? — 4. 

Now, there is a conspicuous difference between the two histograms in Figure 1.5, 
even though they have the same median, the same average, the sàme most probable 
value, and the same number of elements: The first is sharply peaked about the average 
value, whereas the second is broad and flat. (The first might represent the age profile 
for students in a big-city classroom, the second, perhaps, a rural one-room school- 
house.) We need a numerical measure of the amount of "spread" in a distribution, 
with respect to the average. The most obvious way to do this would be to find out 
how far each individual deviates from the average, 


Aj=i-(). [1.10] 


and compute the average of Aj. Trouble is, of course, that you get zero, since, by 
the nature of the average, Aj is as often negative as positive: 


(Aj) 2 G- UPSA PG- UO PO) 
= (j) — (j) = 0. 
(Note that (j) is constant—it does not change as you go from one member of 
the sample to another—so it can be taken outside the summation.) To avoid this 
irritating problem you might decide to average the absolute value of Aj. But 


absolute values are nasty to work with; instead, we get around the sign problem 
by squaring before averaging: 


o? = ((Aj)’). [1.11] 
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This quantity is known as the variance of the distribution; o itself (the square 
root of the average of the square of the deviation from the average— gulp!) is 
called the standard deviation. The latter is the customary measure of the spread 
about (/). 

There is a useful little theorem on variances: 


o? = (Aj) = $ AD! PG) = Yu - uv? Po) 
= DOU? - 244) + PG) 
= PF POA 20» IPOD FUF 5 POD 
2 


2 


-(0^-20)0) + UY = U^ - gy. 


Taking the square root, the standard deviation itself can be written as 


o = (i?) - UY. [1.12] 


In practice, this is a much faster way to get o: Simply calculate (j?) and (j)?, 


subtract, and take the square root. Incidentally, I warned you a moment ago that 
(j*) is not, in general, equal to (j)?. Since ø? is plainly nonnegative (from its 
definition in Equation 1.11), Equation 1.12 implies that 


GASU. [1.13] 


and the two are equal only when ø = 0, which is to say, for distributions with no 
spread at all (every member having the same value). 


1.3.2 Continuous Variables 


So far, I have assumed that we are dealing with a discrete variable—that is, one 
that can take on only certain isolated values (in the example, j had to be an 
integer, since I gave ages only in years). But it is simple enough to generalize to 
continuous distributions. If I select a random person off the street, the probability 
that her age is precisely 16 years, 4 hours, 27 minutes, and 3.333 ... seconds is 
zero. The only sensible thing to speak about is the probability that her age lies in 
some interval —say, between 16 and 17. If the interval is sufficiently short, this 
probability is proportional to the length of the interval. For example, the chance that 
her age is between 16 and 16 plus two days is presumably twice the probability 
that it is between 16 and 16 plus one day. (Unless, I suppose, there was some 
extraordinary baby boom 16 years ago, on exactly that day—in which case we 
have simply chosen an interval] too long for the rule to apply. If the baby boom 
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lasted six hours, we’ll take intervals of a second or less, to be on the safe side. 
Technically, we're talking about infinitesimal intervals.) Thus 


probability that an individual (chosen 
at random) lies between x and (x + dx) 


| = p(x)dx. [1.14] 


The proportionality factor, p(x), is often loosely called “the probability of getting 
x,” but this is sloppy language; a better term is probability density. The probability 
that x lies between a and b (a finite interval) is given by the integral of p(x): 


b 
Pab = | p(x) dx. [1.15] 
a 
and the rules we deduced for discrete distributions translate in the obvious way: 
+00 
ie p) dx, [1.16] 
-0C 
Too 
(x) =f xp(x) dx, [1.17] 
—o6 
+00 
(roy f Fd, [1.18] 
—O00 
o? = (Ax) = (x7) - (x). [1.19] 


Example 1.1 Suppose I drop a rock off a cliff of height /i. As it falls. T snap a 
million photographs, at random intervals. On each picture I measure the distance 
the rock has fallen. Question: What is the average of all these distances? That ts 
to say, what is the time average of the distance traveled?!” 


Solution: The rock starts out at rest, and picks up speed as it falls; it spends more 
time near the top, so the average distance must be less than A/2. Ignoring air 
resistance, the distance x at time t is 


l 5 
x(t) 2 -gt*. 
x (f) zE 


The velocity is dx/dt = gt, and the total flight time is T = ./2h/g. The probability 
that the camera flashes in the interval dt is dt/T, so the probability that a given 


IOA statistician will complain that I am confusing the average of a finite sample (a million, in 
this case) with the "truc" average (over the whole continuum). This can be an awkward problem for 
the experimentalist, especially when the sample size is small, but here I am only concerned. of course, 
with the true average. to which the sample average is presumably a good approximation. 
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h x 


FIGURE 1.6: The probability density in Example 1.1: p(x) = 1/(24/ bx). 
photograph shows a distance in the corresponding range dx is 


dt dx jg 1d 


T gtV2h 2/hx 


dx. 
Evidently the probability density (Equation 1.14) is 


(0c x <h) 


| 
p(x) = Ts 


(outside this range, of course, the probability density is zero). 
We can check this result, using Equation 1.16: 


P 4 l f 
Í IU Tm (2x7) S 
The average distance (Equation 1.17) is 
"NM WP 7 M. 
n [rasta n ns 


which is somewhat less than //2, as anticipated. 

Figure 1.6 shows the graph of p(x). Notice that a probability density can 
be infinite, though probability itself (the integral of p) must of course be finite 
(indeed, less than or equal to 1). 
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*Problem 1.1 For the distribution of ages in Section 1.3.1: 


(a) Compute (j?) and (j)?. 


(b) Determine Aj for each j, and use Equation 1.11 to compute the standard 
deviation. 


(c) Use your results in (a) and (b) to check Equation 1.12. 


Problem 1.2 
(a) Find the standard deviation of the distribution in Example 1.1. 


(b) What is the probability that a photograph, selected at random, would show a 
distance x more than one standard deviation away from the average? 


xProblem 1.3 Consider the gaussian distribution 


p(x) = Ae 9-9. 
where A, a, and A are positive real constants. (Look up any integrals you need.) 


(a) Use Equation 1.16 to determine A. 
(b) Find (x), (x*), and o. 
(c) Sketch the graph of p(x). 


1.4 NORMALIZATION 


We return now to the statistical interpretation of the wave function (Equation 1.3), 
which says that |W(x, t)|* is the probability density for finding the particle at point 
x, at time 7. It follows (Equation 1.16) that the integral of |Y]? must be 1 (the 
particle's got to be somewhere): 


+o 
| W(x, np dx — 1. [1.20] 


—O0 


Without this, the statistical interpretation would be nonsense. 

However, this requirement should disturb you: After all, the wave function is 
supposed to be determined by the Schródinger equation— we can't go imposing 
an extraneous condition on V without checking that the two are consistent. Well, a 
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glance at Equation 1.1 reveals that if W(x, t) is a solution, so too is AV (x. 1), where 
A is any (complex) constant. What we must do, then, is pick this undetermined 
multiplicative factor so as to ensure that Equation 1.20 is satisfied. This process 
is called normalizing the wave function. For some solutions to the Schródinger 
equation the integral is infinite; in that case no multiplicative factor is going to 
make it 1. The same goes for the trivial solution V = 0. Such non-normalizable 
solutions cannot represent particles, and must be rejected. Physically realizable 
states correspond to the square-integrable solutions to Schródinger's equation.!! 

But wait a minute! Suppose I have normalized the wave function at time t = 0. 
How do I know that it will stay normalized, as time goes on, and V evolves? (You 
can't keep renormalizing the wave function, for then A becomes a function of f, 
and you no longer have a solution to the Schródinger equation.) Fortunately, the 
Schródinger equation has the remarkable property that it automatically preserves the 
normalization of the wave function— without this crucial feature the Schródinger 
equation would be incompatible with the statistical interpretation, and the whole 
theory would crumble. 

This is important, so we'd better pause for a careful proof. To begin with, 


d +% +œ 9 5 
zf wa nPar = f — |Y (x, t)|* dx. [1.21] 
dt Joss cos ôt 


(Note that the integral is a function only of t, so I use a total derivative (d/dt) 
in the first expression, but the integrand is a function of x as well as t, so it's a 
partial derivative (8/8t) in the second one.) By the product rule, 


T NA 
ðt Ot 


ð 2 ð 
—|v|-2-—(v*v)-w 1.22 
rhe ess a. [1.22] 
Now the Schrédinger equation says that 


9v ih i 
— = ——- —--VyŅ, 1.23 
0t 2m 0x? hh [ | 


and hence also (taking the complex conjugate of Equation 1.23) 


aw* — iho^w* i 


= -— —— do 1.24 
ðt 2m ðx? ur di 
SO 
8 ih a2  9?w* 8 [ih ow ov* 
iS e t———————p]e-—IlIe[wyt*-— wy). [1.25 
ar | a 0x? 0x? ) Ox E ( Ox Ox ) [1.23] 


!! Evidently V (x. 1?) must go to zero faster than 1/y]x], as |x| — oo. Incidentally. normalization 
only fixes the modulus of A: the phase remains undetermined. However. as we shall see, the latter 
carries no physical significance anyway. 
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The integral in Equation 1.21 can now be evaluated explicitly: 


d rt 5 ih Qv  ov* +90 
E va DPdr = > (P -y 1.26 
dt L ia 2m ( Ox ax ) —00 [1.26] 


But V (x. t) must go to zero as x goes to (+) infinity otherwise the wave function 
would not be normalizable.!? It follows that 


+00 s 

— V (x, f)|* dx = 0, 1.27 
des IV (x, t)| [1.27] 
and hence that the integral is constant (independent of time); if V is normalized 
at f = 0, it stays normalized for all future time. QED 


Problem 1.4 At time t = 0 a particle is represented by the wave function 
Á-. ifüzx-a, 
a 


V(x,0) = aem 
(b—a) 
0, otherwise, 


ifa<x <b, 


where A, a, and b are constants. 


— 


(a) Normalize V (that is, find A, in terms of a and b). 
(b 


(c 


(d 


— 


Sketch V (x, 0), as a function of x. 


~~ 


Where is the particle most likely to be found, at t = 0? 


— 


What is the probability of finding the particle to the left of a? Check your 
result in the limiting cases b = a and b = 2a. 


~~ 


(e) What is the expectation value of x? 


*Problem 1.5 Consider the wave function 


W(x, t) LI Ae ^g iot, 


where A, A, and w are positive real constants. (We'll see in Chapter 2 what potential 
(V) actually produces such a wave function.) 


(a) Normalize V. 
(b) Determine the expectation values of x and x*. 


12A good mathematician can supply you with pathological counterexamples, but they do not arise 
in physics; for us the wave function always goes to zero at infinity. 
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(c) Find the standard deviation of x. Sketch the graph of |W|?, as a function 
of x, and mark the points ((x) +o) and ((x) — o), to illustrate the sense in 
which o represents the “spread” in x. What is the probability that the particle 
would be found outside this range? 


1.5 MOMENTUM 


For a particle in state V, the expectation value of x is 


+00 
(x) = f x|W(x. |^ dx. [1.28] 


|—o0 


What exactly does this mean? It emphatically does not mean that if you measure 
the position of one particle over and over again, f x|W|?dx is the average of the 
results you'll get. On the contrary: The first measurement (whose outcome is inde- 
terminate) will collapse the wave function to a spike at the value actually obtained, 
and the subsequent measurements (if they're performed quickly) will simply repeat 
that same result. Rather, (x) is the average of measurements performed on particles 
all in the state Y, which means that either you must find some way of returning the 
particle to its original state after each measurement, or else you have to prepare a 
whole ensemble of particles, each in the same state V, and measure the positions of 
all of them: (x) is the average of these results. (I like to picture a row of bottles on 
a shelf, each containing a particle in the state V (relative to the center of the bottle). 
A graduate student with a ruler is assigned to each bottle, and at a signal they all 
measure the positions of their respective particles. We then construct a histogram 
of the results, which should match |W|?, and compute the average, which should 
agree with (x). (Of course, since we're only using a finite sample, we can't expect 
perfect agreement, but the more bottles we use, the closer we ought to come.)) In 
short, the expectation value is the average of repeated measurements on an ensem- 
ble of identically prepared systems, not the average of repeated measurements on 
one and the same system. 

Now, as time goes on, (x) will change (because of the time dependence 
of V), and we might be interested in knowing how fast it moves. Referring to 
Equations 1.25 and 1.28, we see that? 


d (x) | ð 2 ih ð ( əy aw* 
— = | x—IWi^dx = — | x— | y*— — V | dx. 1.29 
dt : 5r ids 2m Er Ox Ox : l 


13T keep things from getting too cluttered. I'll suppress the limits of integration. 
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This expression can be simplified using integration-by-parts:'4 


[1.30] 


d(x) th Qv gw* 
— = —-— y*— — V j] dx. 
dt 2m ( Ox Ox i 


(I used the fact that 3x/ðx = 1, and threw away the boundary term, on the ground 
that V goes to zero at (+) infinity.) Performing another integration by parts, on 
the second term, we conclude: 


d(x) . dh Qv 
—— = y* — dx. 1.31 
dt m Ox | 


What are we to make of this result? Note that we're talking about the “veloc- 
ity" of the expectation value of x, which is not the same thing as the velocity of 
the particle. Nothing we have seen so far would enable us to calculate the velocity 
of a particle. It's not even clear what velocity means in quantum mechanics: If the 
particle doesn't have a determinate position (prior to measurement), neither does it 
have a well-defined velocity. All we could reasonably ask for is the probability of 
getting a particular value. We'll see in Chapter 3 how to construct the probability 
density for velocity, given V; for our present purposes it will suffice to postu- 
late that the expectation value of the velocity is equal to the time derivative of the 


expectation value of position: 
d (x) 
= —. 1.32 
pe [1.32] 


Equation 1.31 tells us, then, how to calculate (v) directly from WV. 
Actually, it is customary to work with momentum (p = mv), rather than 
velocity: 


[1.33] 


!HThe product rule says that 


df 
ET )- E n = + a 
from which it follows that 
b d b d 
$ f „b 
— dx= -| —egedx : 
[ re aeta 


Under the integral sign. then, you can peel a derivative off one factor in a product, and slap it onto the 
other one—it'll cost you a minus sign. and you'll pick up a boundary term. 
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Let me write the expressions for (x) and (p) in a more suggestive way: 


(x) = J V* (x) dx. [1.34] 
(p) = J y* (Fs) V dx. [1.35] 
i Ox 


We say that the operator? x “represents” position, and the operator (/1/ i) (8/8x) 
"represents" momentum, in quantum mechanics; to calculate expectation values we 
"sandwich" the appropriate operator between Y* and W, and integrate. 

That's cute, but what about other quantities? The fact is, all classical dynami- 
cal variables can be expressed in terms of position and momentum. Kinetic energy, 
for example, is 


and angular momentum is 
L=rxmv=rxp 


(the latter, of course, does not occur for motion in one dimension). To calculate 
the expectation value of any such quantity, Q(x. p), we simply replace every p 
by (h/i)(8/8x), insert the resulting operator between Y* and V, and integrate: 


(Q(x. p)) = J y*Q (x r2) W dx. [1.36] 


For example, the expectation value of the kinetic energy is 


(T) =-— | ¥* dx. [1.37] 
2 x 


Equation 1.36 is a recipe for computing the expectation value of any dynamical 
quantity, for a particle in state V; it subsumes Equations 1.34 and 1.35 as special 
cases. I have tried in this section to make Equation 1.36 seem plausible, given 
Born's statistical interpretation, but the truth is that this represents such a radically 
new way of doing business (as compared with classical mechanics) that it's a good 
idea to get some practice using it before we come back (in Chapter 3) and put it 
on a firmer theoretical foundation. In the meantime, if you prefer to think of it as 
an axiom, that's fine with me. 


I5 An "operator" is an instruction to do something to the function that follows it. The position 
operator tells you to multiply by x: the momentum operator tells you to differentiate with respect to 
x (and multiply the result by —i/). In this book all operators will be derivatives (d / dt, d? fat. 
a? face . p ; 5 . . 

ð- /dxày. etc.) or multipliers (2. i. x7. etc.). or combinations of these. 
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Problem 1.6 Why can’t you do integration-by-parts directly on the middle expres- 
sion in Equation 1.29—pull the time derivative over onto x, note that dx/dt = 0, 
and conclude that d(x)/dt = 0? 


xProblem 1.7 Calculate d(p)/dt. Answer: 


AB = Cx) [1.38] 


ax 


dt 


Equations 1.32 (or the first part of 1.33) and 1.38 are instances of Ehrenfest’s 
theorem, which tells us that expectation values obey classical laws. 


Problem 1.8 Suppose you add a constant Vo to the potential energy (by "constant" 
I mean independent of x as well as 7). In classical mechanics this doesn't change 
anything, but what about quantum mechanics? Show that the wave function picks 
up a time-dependent phase factor: exp(—i Vot/h). What effect does this have on 
the expectation value of a dynamical variable? 


1.6 THE UNCERTAINTY PRINCIPLE 


Imagine that you're holding one end of a very long rope, and you generate a 
wave by shaking it up and down rhythmically (Figure 1.7). If someone asked you 
"Precisely where is that wave?" you'd probably think he was a little bit nutty: The 
wave isn't precisely any where—-it's spread out over 50 feet or so. On the other 
hand, if he asked you what its wavelength is, you could give him a reasonable 
answer: It looks like about 6 feet. By contrast, if you gave the rope a sudden jerk 
(Figure 1.8), you'd get a relatively narrow bump traveling down the line. This time 
the first question (Where precisely is the wave?) is a sensible one, and the second 
(What is its wavelength?) seems nutty—it isn't even vaguely periodic, so how 
can you assign a wavelength to it? Of course, you can draw intermediate cases, in 
which the wave is fairly well localized and the wavelength is fairly well defined, 
but there is an inescapable trade-off here: The more precise a wave's position is, 
the less precise is its wavelength, and vice versa.!Ó A theorem in Fourier analysis 
makes all this rigorous, but for the moment I am only concerned with the qualitative 
argument. 


l6That's why a piccolo player must be right on pitch. whereas a double-bass player can afford to 
wear garden gloves. For the piccolo. a sixty-fourth note contains many full cycles. and the frequency 
(we're working in the time domain now, instead of space) is well defined. whereas for the bass. at a 
much lower register. the sixty-fourth note contains only a few cycles, and all you hear is a general sort 
of “oomph,” with no very clear pitch. 
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10 20 30 40 50 x (feet) 


FIGURE 1.7: A wave with a (fairly) well-defined wavelength, but an ill-defined 
position. 


go ee 
7 10 20 30 40 50 x (feet) 


FIGURE 1.8: A wave with a (fairly) well-defined position, but an ill-defined wave- 
length. 


This applies, of course, to any wave phenomenon, and hence in particular to 
the quantum mechanical wave function. Now the wavelength of Y is related to the 


momentum of the particle by the de Broglie formula:!? 
} 2n li 
= : = Z, [1.39] 


Thus a spread in wavelength corresponds to a spread in momentum, and our general 
observation now says that the more precisely determined a particle's position is, 
the less precisely is its momentum. Quantitatively, 


[1.40] 


where o, is the standard deviation in x, and øp is the standard deviation in p. 
This is Heisenberg's famous uncertainty principle. (We'll prove it in Chapter 3, 
but I wanted to mention it right away, so you can test it out on the examples in 
Chapter 2.) 

Please understand what the uncertainty principle means: Like position mea- 
surements, momentum measurements yield precise answers—the "spread" here 
refers to the fact that measurements on identically prepared systems do not yield 
identical results. You can, if you want, construct a state such that repeated posi- 
tion measurements will be very close together (by making Y a localized "spike"), 
but you will pay a price: Momentum measurements on this state will be widely 
scattered. Or you can prepare a state with a reproducible momentum (by making 


Up prove this in due course. Many authors take the de Broglie formula as an axiom, from 
which they then deduce the association of momentum with the operator (//7)(8/8x). Although this is 
a conceptually cleaner approach, it involves diverting mathematical complications that I would rather 
save for later. 
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W a long sinusoidal wave), but in that case, position measurements will be widely 
scattered. And, of course, if you’re in a really bad mood you can create a state for 
which neither position nor momentum is well defined: Equation 1.40 is an inequal- 
ity, and there's no limit on how big oy and op can be—just make V some long 
wiggly line with lots of bumps and potholes and no periodic structure. 


xProblem 1.9 A particle of mass mn is in the state 
V(x.t)— Ae an /h)ir]. 


where A and a are positive real constants. 


(a) Find A. 


(b) For what potential energy function V(x) does W satisfy the Schrödinger 
equation? 


(c) Calculate the expectation values of x, x?, p, and p^. 


(d) Find c, and €. Is their product consistent with the uncertainty principle? 


FURTHER PROBLEMS FOR CHAPTER 1 


Problem 1.10 Consider the first 25 digits in the decimal expansion of x (3, 1, 4, 
1, 5,9,...). 


(a) If you selected one number at random, from this set, what are the probabilities 
of getting each of the 10 digits? 


(b) What is the most probable digit? What is the median digit? What is the 
average value? 


(c) Find the standard deviation for this distribution. 


Problem 1.11 The needle on a broken car speedometer is free to swing, and 
bounces perfectly off the pins at either end, so that if you give it a flick it is 
equally likely to come to rest at any angle between 0 and z. 


(a) What is the probability density, o(0)? Hint: p(0) dO is the probability that 
the needle will come to rest between 0 and (0 -- d0). Graph p (0) as a function 
of 0, from —7:/2 to 32/2. (Of course, part of this interval is excluded, so p 
is zero there.) Make sure that the total probability is 1. 
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(b) Compute (0), (02), and c, for this distribution. 


(c) Compute (sin), (cos 0), and (cos? 0). 


Problem 1.12 We consider the same device as the previous problem, but this time 
we are interested in the x-coordinate of the needle point—that is, the “shadow,” 
or "projection," of the needle on the horizontal line. 


(a) What is the probability density o(x)? Graph p(x) as a function of x, from 
—2r to +2r, where r is the length of the needle. Make sure the total prob- 
ability is 1. Hint: p(x) dx is the probability that the projection lies between 
x and (x + dx). You know (from Problem 1.11) the probability that 0 is in 
a given range; the question is, what interval dx corresponds to the inter- 
val d0? 


(b 


Compute (x), (x*), and ø, for this distribution. Explain how you could have 
obtained these results from part (c) of Problem 1.11. 


~x 


x xProblem 1.13 Buffon's needle. A needle of length / is dropped at random onto a 
sheet of paper ruled with parallel lines a distance / apart. What is the probability 
that the needle will cross a line? Hint: Refer to Problem 1.12. 


Problem 1.14 Let P,,(t) be the probability of finding a particle in the range 
(a « x « b), at time f. 


(a) Show that 


d Pab 
dt 


= J(a.t) — J(b.t). 


where 


jer) ih Y gy * prot 
x.f)2 — — — Es 
2m Ox Ox 


What are the units of J (x. f)? Comment: J is called the probability current, 
because it tells you the rate at which probability is "flowing" past the point 
x. If P4) (t) is increasing, then more probability is flowing into the region at 
one end than flows out at the other. 


€ 


Find the probability current for the wave function in Problem 1.9. (This is 
not a very pithy example, I’m afraid; we'll encounter more substantial ones 
in due course.) 
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x xProblem 1.15 Suppose you wanted to describe an unstable particle, that spon- 
taneously disintegrates with a "lifetime" +. In that case the total probability of 
finding the particle somewhere should not be constant, but should decrease at 
(say) an exponential rate: 


Tco 
pas f W(x, D dx 2e. 


—O0c 


A crude way of achieving this result is as follows. In Equation 1.24 we tacitly 
assumed that V (the potential energy) is real. That is certainly reasonable, but it 
leads to the "conservation of probability" enshrined in Equation 1.27. What if we 
assign to V an imaginary part: 


V=V)- iT, 


where Vo is the true potential energy and I is a positive real constant? 


(a) 


(b) 


Show that (in place of Equation 1.27) we now get 
dP 2r 
dt h 


Solve for P (t), and find the lifetime of the particle in terms of T. 


Problem 1.16 Show that 


du ud 


for any two (normalizable) solutions to the Schrödinger equation, V; and Y2. 


Problem 1.17 A particle is represented (at time t = 0) by the wave function 


V(x,0) = otherwise. 


A(a? —x?). if -a <x € +a. 
0, 

Determine the normalization constant A. 

What is the expectation value of x (at time ; = 0)? 


What is the expectation value of p (at time t = 0)? (Note that you cannot 
get it from p — md(x)/dt. Why not?) 


Find the expectation value of x?. 
Find the expectation value of p?. 


Find the uncertainty in x (ox). 
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(g) Find the uncertainty in p (op). 


(h) Check that your results are consistent with the uncertainty principle. 


Problem 1.18 In general, quantum mechanics is relevant when the de Broglie 
wavelength of the particle in question (h/p) is greater than the characteristic size 
of the system (d). In thermal equilibrium at (Kelvin) temperature 7, the average 
kinetic energy of a particle is 


P gr 
2m 2 
(where kg is Boltzmann's constant), so the typical de Broglie wavelength is 
m [1.41] 
v3mkgT 


The purpose of this problem is to anticipate which systems will have to be treated 
quantum mechanically, and which can safely be described classically. 


(a) Solids. The lattice spacing in a typical solid is around d — 0.3 nm. Find the 
temperature below which the free! electrons in a solid are quantum mechan- 
ical. Below what temperature are the nuclei in a solid quantum mechanical? 
(Use sodium as a typical case.) Moral: The free electrons in a solid are 
always quantum mechanical; the nuclei are almost never quantum mechani- 
cal. The same goes for liquids (for which the interatomic spacing is roughly 
the same), with the exception of helium below 4 K. 


(b) Gases. For what temperatures are the atoms in an ideal gas at pressure P 
quantum mechanical? Hint: Use the ideal gas law (PV — NkgT) to deduce 
the interatomic spacing. Answer: T < (1/kg)(1?/3m)?? P2, Obviously 
(for the gas to show quantum behavior) we want m to be as small as possible. 
and P as large as possible. Put in the numbers for helium at atmospheric 
pressure. Is hydrogen in outer space (where the interatomic spacing is about 
] cm and the temperature is 3 K) quantum mechanical? 


'8In a solid the inner electrons are attached to a particular nucleus. and for them the relevant 
size would be the radius of the atom. But the outermost electrons are not attached. and for them the 
relevant distance is the lattice spacing. This problem pertains to the outer electrons. 


CHAPTER 2 


TIME-INDEPENDENT 
SCHRODINGER EQUATION 


2.1 STATIONARY STATES 


In Chapter 1 we talked a lot about the wave function, and how you use it to 
calculate various quantities of interest. The time has come to stop procrastinating, 
and confront what is, logically, the prior question: How do you get V(x, f) in the 
first place? We need to solve the Schródinger equation, 


aw n? 32y 


= —— —- + VY, 2.1 
ðt 2m ðx? t [2.1] 


for a specified potential! V (x. 1). In this chapter (and most of this book) I shall 
assume that V is independent of t. In that case the Schródinger equation can be 
solved by the method of separation of variables (the physicist's first line of attack 
on any partial differential equation): We look for solutions that are simple products, 


V(x.f) = W(x) g(t). [2.2] 


where y (lower-case) is a function of x alone, and q is a function of t alone. On 
its face, this is an absurd restriction, and we cannot hope to get more than a tiny 


lIt is tiresome to keep saying "potential energy function." so most people just call V the 
"potential." even though this invites occasional confusion with electric potential. which is actually 
potential energy per unit charge. 
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subset of all solutions in this way. But hang on, because the solutions we do obtain 
turn out to be of great interest. Moreover (as is typically the case with separation 
of variables) we will be able at the end to patch together the separable solutions 
in such a way as to construct the most general solution. 
For separable solutions we have 
aw dp Y dy 


a ar óx ^ dx?” 
(ordinary derivatives, now), and the Schródinger equation reads 


do h? dy 
iy — = —— ——9 + V yo. 
uy dt imd” TM 


Or, dividing through by vq: 


d dt mw dx? oy [2.3] 
Now, the left side is a function of t alone, and the right side is a function of 
x alone.* The only way this can possibly be true is if both sides are in fact 
constant — otherwise, by varying t, I could change the left side without touching 
the right side, and the two would no longer be equal. (That’s a subtle but crucial 
argument, so if it's new to you, be sure to pause and think it through.) For reasons 
that will appear in a moment, we shall call the separation constant E. Then 


1 do 
h——— =E, 
o dt 
Or 
do iE 
— = —— 9, 2.4 
dt p^ [2-4] 
and 
or 


[2.5] 


Separation of variables has turned a partial differential equation into two ordi- 
nary differential equations (Equations 2.4 and 2.5). The first of these (Equation 2.4) 


2Note that this would nor be true if V were a function of as well as x. 
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is easy to solve (just multiply through by df and integrate); the general solution is 
C exp(—i Et /h), but we might as well absorb the constant C into y (since the quantity 
of interest is the product yg). Then 


g(t) = e FEA [2.6] 


The second (Equation 2.5) is called the time-independent Schrödinger equation; 
we can go no further with it until the potential V(x) is specified. 

The rest of this chapter will be devoted to solving the time-independent 
Schrédinger equation, for a variety of simple potentials. But before I get to 
that you have every right to ask: What’s so great about separable solutions? 
After all, most solutions to the (time dependent) Schrédinger equation do not 
take the form v (x)g(t). I offer three answers—two of them physical, and one 
mathematical: 

1. They are stationary states. Although the wave function itself, 


Vie t) = peT E, [2.7] 
does (obviously) depend on t, the probability density, 
Irc, DP = why = ytet Eye iE = rol? [2.8] 


does nor —the time-dependence cancels out.? The same thing happens in calculat- 
ing the expectation value of any dynamical variable; Equation 1.36 reduces to 


h d 
(Q(x, p) = J vO (x vx) y dx. [2.9] 
i dx 


Every expectation value is constant in time; we might as well drop the factor g(t) 
altogether, and simply use y in place of V. (Indeed, it is common to refer to y as 
"the wave function," but this is sloppy language that can be dangerous, and it is 
important to remember that the true wave function always carries that exponential 
time-dependent factor.) In particular, (x) is constant, and hence (Equation 1.33) 
(p) — 0. Nothing ever happens in a stationary state. 

2. They are states of definite total energy. In classical mechanics, the total 
energy (kinetic plus potential) is called the Hamiltonian: 


pt 


BRR 2m 


+ V(x). [2.10] 


3For normalizable solutions, E must be real (see Problem 2.1(a)). 
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The corresponding Hamiltonian operator, obtained by the canonical substitution 
p — (h/i)(8/8x), is therefore* 


2 92 
^ 


h 
H-————-TVQ) 2.11 
Om Oxi 99 [2.11] 


Thus the time-independent Schrödinger equation (Equation 2.5) can be written 
Hy = Ey, [2.12] 


and the expectation value of the total energy is 


(H) = | y'Ĥydx=E [fasc E f i¥?ax =E. [2.13] 
(Notice that the normalization of V entails the normalization of y.) Moreover, 
H?y = B(Hy) = É(Ey) = E(AW) = E?y, 
and hence 
(H?) = [vues = e f Iy? dx = E?. 
So the variance of H is 
oz; = (H?) — (HY = E’ — E? =0. [2.14] 


But remember, if o = 0, then every member of the sample must share the same 
value (the distribution has zero spread). Conclusion: A separable solution has the 
property that every measurement of the total energy is certain to return the value 
E. (That's why I chose that letter for the separation constant.) 

3. The general solution is a linear combination of separable solutions. As 
we're about to discover, the time-independent Schródinger equation (Equation 2.5) 
yields an infinite collection of solutions (W(x), W2(x), Va(x)....), each with 
its associated value of the separation constant (Ej, E2, E3,...); thus there is a 
different wave function for each allowed energy: 


Vi (x, 1) 2 Vi G)e et", dex, n) = pae, .... 
Now (as you can easily check for yourself) the (time-dependent) Schródinger 


equation (Equation 2.1) has the property that any linear combination? of solutions 


* Whenever confusion might arise. I'll put a “hat” (^) on the operator, to distinguish it from the 
dynamical variable it represents. 


? A linear combination of the functions /1(z). f2(z). ... is an expression of the form 
Fosa eaf). 


where c4. 2. ... are any (complex) constants. 
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is itself a solution. Once we have found the separable solutions, then, we can 
immediately construct a much more general solution, of the form 


oo 
Vx. t) = do enr Ge Ph, [2.15] 


n=l 


It so happens that every solution to the (time-dependent) Schródinger equation 
can be written in this form—it is simply a matter of finding the right constants 
(ci, C2, ...) so as to fit the initial conditions for the problem at hand. You'll see 
in the following sections how all this works out in practice, and in Chapter 3 we'll 
put it into more elegant language, but the main point is this: Once you've solved 
the time-in dependent Schródinger equation, you're essentially done; getting from 
there to the general solution of the time-dependent Schrödinger equation is, in 
principle, simple and straightforward. 

A lot has happened in the last four pages, so let me recapitulate, from a 
somewhat different perspective. Here's the generic problem: You're given a (time- 
independent) potential V (x), and the starting wave function V(x, 0); your job is 
to find the wave function, V (x, f), for any subsequent time f. To do this you must 
solve the (time-dependent) Schrödinger equation (Equation 2.1). The strategy? is 
first to solve the time-independent Schródinger equation (Equation 2.5); this yields, 
in general, an infinite set of solutions (Vr (x), W2(x), Va(x). ...), each with its own 
associated energy (Ej, E», E5,...). To fit Y(x,0) you write down the general 
linear combination of these solutions: 


oo 
V(x,0)— 25 Cn Vu (X): [2.16] 


n=] 


the miracle is that you can always match the specified initial state by appropriate 
choice of the constants cj, c2, c3, ... . To construct V(x,f) you simply tack onto 
each term its characteristic time dependence, exp(—i E, /ħ): 


oo oo 
VaN = 3 jen Qe tll” = oi Qr 1). [2.17] 


n=l n=l 


The separable solutions themselves, 


Wy (x,t) = Yre Eh, [2.18] 


Occasionally you can solve the time-dependent Schrödinger equation without recourse to sep- 
aration of variables— see. for instance. Problems 2.49 and 2.50. But such cases are extremely rare. 
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are Stationary states, in the sense that all probabilities and expectation values are 
independent of time, but this property is emphatically not shared by the general 
solution (Equation 2.17); the energies are different, for different stationary states, 
and the exponentials do not cancel, when you calculate |Y }?. 


Example 2.1 Suppose a particle starts out in a linear combination of just two 
stationary states: 


V (x. 0) = cr Wi (x) + e2ya(x). 


(To keep things simple I'll assume that the constants c, and the states w,,(.x) are 
real.) What is the wave function V (x. f) at subsequent times? Find the probability 
density, and describe its motion. 


Solution: The first part is easy: 
Pex, t) = eii Ge I PI + eye Get PII, 
where E; and E» are the energies associated with yı and wo. It follows that 


IW N = (ere PI + cone E yepe iE + cope HE") 


= eit + e$U$ + 2ci caria cos[(E2 — Ep /h]. 


(I used Euler’s formula, exp i0 = cos0 +i sin, to simplify the result.) Evidently 
the probability density oscillates sinusoidally, at an angular frequency (E? — E1)/ħ; 
this is certainly not a stationary state. But notice that it took a /inear combination 
of states (with different energies) to produce motion." 


xProblem 2.1 Prove the following three theorems: 


(a) For normalizable solutions, the separation constant E must be real. Hint: 
Write E (in Equation 2.7) as Eo + iT (with Ep and T real), and show that 
if Equation 1.20 is to hold for all t, T must be zero. 


(b) The time-independent wave function w(x) can always be taken to be real 
(unlike V (x. f), which is necessarily complex). This doesn't mean that every 
solution to the time-independent Schródinger equation is real; what it says 
is that if you've got one that is not, it can always be expressed as a linear 
combination of solutions (with the same energy) that are. So you might as 
well stick to y/'s that are real. Hint: If w(x) satisfies Equation 2.5, for a 
given E, so too does its complex conjugate, and hence also the real linear 
combinations (y + V*) and i (y — y*). 


This is nicely illustrated by an applet at the Web site http://thorin.adne.com/~topquark/ 
quantum/deepwellmain.html. 
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(c) If V(x) is an even function (that is, V(—x) = V(x)) then y (x) can always 
be taken to be either even or odd. Hint: If w(x) satisfies Equation 2.5, for 
a given E, so too does v(—x), and hence also the even and odd linear 
combinations y (x) + y(—x). 


xProblem 2.2 Show that E must exceed the minimum value of V (x), for every 
normalizable solution to the time-independent Schródinger equation. What is the 
classical analog to this statement? Hint: Rewrite Equation 2.5 in the form 


dw | 2m PP 
uam polis Ely: 


if E < Vmin, then y and its second derivative always have the same sign —argue 
that such a function cannot be normalized. 


2.2 THE INFINITE SQUARE WELL 


Suppose 
0. fO0<x <a, 
co. otherwise 


V(x)= | [2.19] 


(Figure 2.1). A particle in this potential is completely free, except at the two ends 
(x = 0 and x = a), where an infinite force prevents it from escaping. A classical 
model would be a cart on a frictionless horizontal air track, with perfectly elastic 
bumpers—-it just keeps bouncing back and forth forever. (This potential is artifi- 
cial, of course, but I urge you to treat it with respect. Despite its simplicity —or 
rather, precisely because of its simplicity—it serves as a wonderfully accessi- 
ble test case for all the fancy machinery that comes later. We’ll refer back to it 
frequently.) 


V(x) 


FIGURE 2.1: The infinite square well poten- 
a X tial (Equation 2.19). 
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Outside the well, (x) = O (the probability of finding the particle there is 
zero). Inside the well, where V = 0, the time-independent Schrödinger equation 
(Equation 2.5) reads 


h? d^y 
Za = Ey, 2.2 
2m dx? v [4:20] 
or : 
dà = —k^y. where k = E [2.21] 


(By writing it in this way, I have tacitly assumed that E > 0; we know from 
Problem 2.2 that E « 0 won't work.) Equation 2.21 is the classical simple har- 
monic oscillator equation; the general solution is 


V (x) = Asinkx + B coskx, [2.22] 


where A and B are arbitrary constants. Typically, these constants are fixed by the 
boundary conditions of the problem. What are the appropriate boundary con- 
ditions for v (x)? Ordinarily, both y and dy/dx are continuous, but where the 
potential goes to infinity only the first of these applies. (I'll prove these boundary 
conditions, and account for the exception when V — oo, in Section 2.5; for now I 
hope you will trust me.) 

Continuity of w(x) requires that 


y(0) = V (a) — 0. [2.23] 


so as to join onto the solution outside the well. What does this tell us about A and 
B? Well, 


y (0) = A sin0 + Bcos0 = B, 


so B = 0, and hence 
w(x) = Asinkx. [2.24] 


Then w(a) = Asinka, so either A = 0 (in which case we’re left with the triv- 
ial—non-normalizable—solution w(x) = 0), or else sinka = 0, which means 
that 

ka =0, tx, £2n, t3m, ... [2.25] 


But k = 0 is no good (again, that would imply w(x) = 0), and the negative 
solutions give nothing new, since sin(—@) = —sin(@) and we can absorb the 


minus sign into A. So the distinct solutions are 


ic = Hite es] 9.35 22 [2.26] 


32 


Chapter2 Time-Independent Schrödinger Equation 


y(x) Wo(X) W(X) 


FIGURE 2.2: The first three stationary states of the infinite square well (Equation 2.28). 


Curiously, the boundary condition at x = a does not determine the constant 
A, but rather the constant k, and hence the possible values of E: 


[2.27] 


In radical contrast to the classical case, a quantum particle in the infinite square 
well cannot have just any old energy—it has to be one of these special allowed 
values.) To find A, we normalize y: 


EIE 2a 4 2 
Í |A|- sin“ (kx) dx = ADS zl, so [Al = —. 
0 a 


This only determines the magnitude of A, but it is simplest to pick the positive real 
root: A = 4/2/a (the phase of A carries no physical significance anyway). Inside 
the well, then, the solutions are 


[2.28] 


As promised, the time-independent Schródinger equation has delivered an 
infinite set of solutions (one for each positive integer n). The first few of these are 
plotted in Figure 2.2. They look just like the standing waves on a string of length a; 
V1, which carries the lowest energy, is called the ground state, the others, whose 
energies increase in proportion to n’, are called excited states. As a collection, the 
functions V;(x) have some interesting and important properties: 

1. They are alternately even and odd, with respect to the center of the well: 
V is even, y» is odd, y3 is even, and so on.? 


8Notice that the quantization of energy emerged as a rather technical consequence of the bound- 
ary conditions on solutions to the time-independent Schródinger equation. 


°To make this symmetry more apparent, some authors center the well at the origin (running it 
from —a to +a). The even functions are then cosines. and the odd ones are sines. See Problem 2.36. 
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2. As you go up in energy, each successive state has one more node (zero- 
crossing): y; has none (the end points don't count), y has one, y3 has two, and 
so on. 

3. They are mutually orthogonal, in the sense that 


J Vin (x)* Vr, (x) dx 2 0, [2.29] 


whenever m Æ n. Proof: 
2 T4 ] 
J Vs GO Va (x) dx = E f sin (=x) sin (=) dx 
a Jo a a 
1 f | (* —n ) (* +n ) 
— COS zx|-—cos| ——zx]|dx 
a Jo a l a 
1 . (m-—n ] . (m+n 
= 4————— sin | —— rx | — ————— sin mx 
(m — n)x a (m+ n)x a 


zd [m snel _ sin[(m + n)z] | a 
EE (m — n) (m+n) g 


a 


0 


Note that this argument does not work if m = n. (Can you spot the point at which 
it fails?) In that case normalization tells us that the integral is 1. In fact, we can 
combine orthogonality and normalization into a single statement: !0 


J Vn QO* Vn (x) dx = pn. [2.30] 


where ômn (the so-called Kronecker delta) is defined in the usual way, 


E | 0, ifmÆn; [2.31] 


l. ifm =n. 


We say that the y `s are orthonormal. 
4. They are complete, in the sense that any other function, f(x), can be 
expressed as a linear combination of them: 


fe) 2 Y edo - je doen sin (x). [2.32] 


n=] n=l 


'0Tn this case the V''sare real. so the * on Y is unnecessary. but for future. purposes it's a good 
idea to get in the habit of putting it there. 
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I'm not about to prove the completeness of the functions sin (7t x /a), but if you've 
studied advanced calculus you will recognize that Equation 2.32 is nothing but the 
Fourier series for f(x), and the fact that "any" function can be expanded in this 
way is sometimes called Dirichlet’s theorem. !! 

The coefficients c,, can be evaluated —for a given f(x)—by a method I call 
Fourier's trick, which beautifully exploits the orthonormality of {y,,}: Multiply 
both sides of Equation 2.32 by y; (x)*. and integrate. 


J Vm (x)* f (x) dx — a I Vm CO* Vn (x)dx = X CnOmn = Cm. [2.33] 


n-i n=l 


(Notice how the Kronecker delta kills every term in the sum except the one for 
which n = m.) Thus the nth coefficient in the expansion of f(x) is!? 


Cn = J Wn (x)* f(x) dx. [2.34] 


These four properties are extremely powerful, and they are not peculiar to the 
infinite square well. The first is true whenever the potential itself is a symmetric 
function; the second is universal, regardless of the shape of the potential.!3 Orthog- 
onality is also quite general —I'll show you the proof in Chapter 3. Completeness 
holds for all the potentials you are likely to encounter, but the proofs tend to be 
nasty and laborious; I’m afraid most physicists simply assume completeness, and 
hope for the best. 

The stationary states (Equation 2.18) of the infinite square well are evidently 


2 tyler? 2 
W(x, t) = 2 sin (— ) e z-h/2ma Me [2.35] 


I claimed (Equation 2.17) that the most general solution to the (time-dependent) 
Schródinger equation is a linear combination of stationary states: 


oo 
2. /nn EU 2 
V(x. t) = Yen)? sin (=x) [a Lr n h/2ma n [2.36] 


n=1 


!! See, for example, Mary Boas, Mathematical Methods in the Physical Sciences, 2d ed. (New 
York: John Wiley, 1983), p. 313: f(x) can even have a finite number of finite discontinuities. 

!?Tt doesn't matter whether you use m or n as the "dummy index" here (as long as you are 
consistent on the two sides of the equation. of course): whatever letter you usc. it just stands for “any 
positive integer." 

H3See. for example, John L. Powell and Bernd Crasemann, Quantum Mechanics (Addison- 
Wesley, Reading. MA, 1961). p. 126. 
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(If you doubt that this is a solution, by all means check it!) It remains only for 
me to demonstrate that I can fit any prescribed initial wave function, V (x. 0), by 
appropriate choice of the coefficients c,: 


V(x,.0)— > CnWn(X). 
n=1 


The completeness of the w’s (confirmed in this case by Dirichlet’s theorem) guar- 
antees that I can always express Y(x.0) in this way, and their orthonormality 
licenses the use of Fourier’s trick to determine the actual coefficients: 


2 a 
e =f — | sin (=x) W(x. 0) dx. [2.37] 
a Jo a 


That does it: Given the initial wave function, V (x. 0), we first compute the 
expansion coefficients c, , using Equation 2.37, and then plug these into Equation 2.36 
to obtain V (x. t). Armed with the wave function, we are in a position to compute any 
dynamical quantities of interest, using the procedures in Chapter 1. And this same 
ritual applies to any potential —the only things that change are the functional form 
of the w’s and the equation for the allowed energies. 


Example 2.2 A particle in the infinite square well has the initial wave function 
V(x.0)—Ax(a—-x). (O<x <a). 


for some constant A (see Figure 2.3). Outside the well, of course, V = 0. Find 
V (x. t). 


Y (x, 0) 


a X 


FIGURE 2.3: The starting wave function in Example 2.2. 
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Solution: First we need to determine A, by normalizing W(x. 0): 


ker | (x. 0)|? dx = |A|* 2 X ?(a — x)? ax = AP. 


SO 


The mth coefficient is (Equation 2.37) 


Gi = Eo Hn ie 
a 


2. 15 : ( a " R " y) ax T (= ) 
= all —) sin( —x) — —cos{—vx 
a? na a nz a 
an? . nm (nzx/a) —2 na 
—|2(—) xsin Ty) TI oos Tx) 
| = iis e : (nz/a)? » s : | 


w] a? (nx x)? — 


— — cos(nz) +a 
nz 


2 
"mS otn La? aay T 
= 
(nx 


0. if n is even. 
8J/15/(nz)). if n is odd. 


3 [cos(0) — cos(nz )] 


Thus (Equation 2.36): 


n 3 
V (x i= 30 (=) i sin (=x) pain? a?hħt/2ma? 
xX. á 3 P J : 
nz].3.8.. 


Loosely speaking, c, tells you the "amount of y, that is contained in Y.” 
Some people like to say that |c,|? is the "probability of finding the particle in the 
nth stationary state," but this is bad language; the particle is in the state V, not 
V, , and, anyhow, in the laboratory you don't "find a particle to be in a particular 
state"—you measure some observable, and what you get is a number. As we'll 
see in Chapter 3, what |c„|? tells you is the probability that a measurement of the 
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energy would yield the value E,, (a competent measurement will always return one 
of the “allowed” values—hence the name—and |c, |? is the probability of getting 
the particular value E,). 

Of course, the sum of these probabilities should be 1, 


[2.38] 


Indeed, this follows from the normalization of W (the c,,’s are independent of time, 
so I’m going to do the proof for t = 0; if this bothers you, you can easily generalize 
the argument to arbitrary f). 


1- J IY (x, o) dx = J p 22! (Xen) dx 


n=] n=l 
co oc 


= 3563 js | into V, (x) dx 


ESTEE 
0 00 
= ) 2: C5, Cn8 mn = = 3 ler 
n=l] m=] n=! 


(Again, the Kronecker delta picks out the term m =n in the summation over m.) 
Moreover, the expectation value of the energy must be 


» 
(H) = 3 leal En. [2.39] 


n=] 


and this too can be checked directly: The time-independent Schródinger equation 
(Equation 2.12) says 
Hy, = Ey. [2.40] 


SO 


(H) — J W* HW dx = J (S aa) H (o) dx 
= 2 D oon j V Wn dx — y» Icy |? En. 


Notice that the probability of getting a particular energy is independent of time, and 
so, a fortiori, is the expectation value of H. This is a manifestation of conservation 
of energy in quantum mechanics. 
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Example 2.3 In Example 2.2 the starting wave function (Figure 2.3) closely re- 
sembles the ground state yı (Figure 2.2). This suggests that |c; | should dominate, 


and in fact 
2 
Icy | xi 


The rest of the coefficients make up the difference: 


oo 2 oc 
2 84A 15 l 
5 len |? = ( EE. 2 noo 


n=] 1.3.5.... 


5 
8Vi5\ 

: | = 0.998555.... 
Iv: 


l4 


The expectation value of the energy, in this example, is 


oc 2 7 939 oc "$ 
8./15\ n2x2h? 4800h? 1 5h? 
(H) = > E E n epi cum > Em 


3.3 CEU er EE 2 
new 2ma xma ma 
n=1.3.5.... n-].3.5.... 


As one might expect, it is very close to Ej = z2n?/2ma? — slightly larger, because 
of the admixture of excited states. 


Problem 2.3 Show that there is no acceptable solution to the (time-independent) 
Schrödinger equation for the infinite square well with E = 0 or E < 0. (This isa 
special case of the general theorem in Problem 2.2, but this time do it by explicitly 
solving the Schródinger equation, and showing that you cannot meet the boundary 
conditions.) 


Problem 2.4 Calculate (x), (x?), (p). (p?), ox, and Op, for the nth stationary state 
of the infinite square well. Check that the uncertainty principle is satisfied. Which 
state comes closest to the uncertainty limit? 


xProblem 2.5 A particle in the infinite square well has as its initial wave function 
an even mixture of the first two stationary states: 


V (x. 0) = A[V1GO + V2()]. 


l^ You can look up the series 


16 *36 * 56 * 7 = 960 
and 

La zd usns 

It sx s ^. 96 


in math tables. under "Sums of Reciprocal Powers" or "Riemann Zeta Function." 
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(a) Normalize Y (x, 0). (That is, find A. This is very easy, if you exploit the 
orthonormality of y and y». Recall that, having normalized V at t = 0, 
you can rest assured that it stays normalized —if you doubt this, check it 
explicitly after doing part (b).) 


(b) Find W(x, ¢) and |W(x, t)i. Express the latter as a sinusoidal function of 
time, as in Example 2.1. To simplify the result, let w = z?h/2ma?. 


(c) Compute (x). Notice that it oscillates in time. What is the angular frequency 
of the oscillation? What is the amplitude of the oscillation? (If your amplitude 
is greater than a/2, go directly to jail.) 


(d) Compute (p). (As Peter Lorre would say, “Do it ze kveek vay, Johnny") 


(e) If you measured the energy of this particle, what values might you get, and 
what is the probability of getting each of them? Find the expectation value 
of H. How does it compare with E; and E2? 


Problem 2.6 Although the overall phase constant of the wave function is of no 
physical significance (it cancels out whenever you calculate a measurable quantity), 
the relative phase of the coefficients in Equation 2.17 does matter. For example, 
suppose we change the relative phase of y, and y» in Problem 2.5: 


V(x.0) = Aly (x) e? ya (x)]. 


where @ is some constant. Find V(x, t), |Y (x, Dp, and (x), and compare your 
results with what you got before. Study the special cases @ = x/2 and $ = v. 
(For a graphical exploration of this problem see the applet in footnote 7.) 


xProblem 2.7 A particle in the infinite square well has the initial wave function? 


Ax. 0 <x <a/2, 
Ala— x). a/2<x <a. 


y(x. 0) = | 


(a) Sketch ¥(x. 0), and determine the constant A. 
(b) Find W(x. f). 


'SThere is no restriction in principle on the shape of the starting wave function. as long 
as it is normalizable. In particular, V(x.0) need not have a continuous derivative—in fact, it 
doesn't even have to be a continuous function. However. if you uy lo calculate (Æ) using 
f 9 G.0)* Hr (x, 0) dx in such a case, you may encounter technical difficulties, because the second 
derivative of V (x. 0) is ill-defined. It works in Problem 2.9 because the discontinuities occur at the end 
points, where the wave function is zero anyway. In Problem 2.48 you'll see how to manage cases like 
Problem 2.7. 
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(c) What is the probability that a measurement of the energy would yield the 
value Ej? 


(d) Find the expectation value of the energy. 


Problem 2.8 A particle of mass m in the infinite square well (of width a) starts 
out in the left half of the well, and is (at t = 0) equally likely to be found at any 
point in that region. 


(a) What is its initial wave function, Y (x, 0)? (Assume it is real. Don't forget 
to normalize it.) 


(b) What is the probability that a measurement of the energy would yield the 
value z?h? /2ma?? 


Problem 2.9 For the wave function in Example 2.2, find the expectation value of 
H, at time t = 0, the “old fashioned" way: 


(H) = [ vo. ive. dx. 


Compare the result obtained in Example 2.3. Note: Because (H) is independent of 
time, there is no loss of generality in using t = 0. 


2.3 THE HARMONIC OSCILLATOR 
The paradigm for a classical harmonic oscillator is a mass mw attached to a spring 
of force constant k. The motion is governed by Hooke's law, 


2 


F = —kx =m—> 
X m dr? 


(ignoring friction), and the solution is 


x(t) = A sin(@t) + B cos(or), 


w= = [2.41] 
m. 


is the (angular) frequency of oscillation. The potential energy is 


where 


1 
V(x) = jh [2.42] 


its graph is a parabola. 
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FIGURE 2.4: Parabolic approximation (dashed curve) to an arbitrary potential, in 
the neighborhood of a local minimum. 


Of course, there’s no such thing as a perfect harmonic oscillator—if you 
stretch it too far the spring is going to break, and typically Hooke’s law fails 
long before that point is reached. But practically any potential is approximately 
parabolic, in the neighborhood of a local minimum (Figure 2.4). Formally, if we 
expand V(x) in a Taylor series about the minimum: 


l 
V(x) = V (xo) + V'(xo)(x — xo) + 5 V" (xo) (x E e qeu 


subtract V (xo) (you can add a constant to V(x) with impunity, since that doesn't 
change the force), recognize that V’(xq) = 0 (since xo is a minimum), and drop the 
higher-order terms (which are negligible as long as (x — xo) stays small), we get 


1 5 
V(x) = 5 V0) — xo)". 


which describes simple harmonic oscillation (about the point xo), with an effective 
spring constant k = V"(xg).'6 That's why the simple harmonic oscillator is so 
important: Virtually any oscillatory motion is approximately simple harmonic, as 
long as the amplitude is small. 

The quantum problem is to solve the Schródinger equation for the potential 


l 
V(x) = zmax? [2.43] 


(it is customary to eliminate the spring constant in favor of the classical frequency, 
using Equation 2.41). As we have seen, it suffices to solve the time-independent 
Schrödinger equation: 

h dy |l 


Eco d jme x^y = Ey. [2.44] 


l6Note that V"(xg) > 0. since by assumption xg is a minimum. Only in the rare case V” (xy) = 0 
is the oscillation not even approximately simple harmonic. 
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In the literature you will find two entirely different approaches to this problem. 
The first is a straightforward “brute force” solution to the differential equation, 
using the power series method; it has the virtue that the same strategy can be 
applied to many other potentials (in fact, we'll use it in Chapter 4 to treat the 
Coulomb potential). The second is a diabolically clever algebraic technique, using 
so-called ladder operators. I’ll show you the algebraic method first, because it is 
quicker and simpler (and a lot more fun);!? if you want to skip the power series 
method for now, that's fine, but you should certainly plan to study it at some 
stage. 


2.3.1 Algebraic Method 
To begin with, let's rewrite Equation 2.44 in a more suggestive form: 


l5 o = | 
2, P + nox))y = Ey. [2.45] 


where p = (h/i)d/dx is, of course, the momentum operator. The basic idea is to 


factor the Hamiltonian, 


1 
H = —[p? + (mox)’]. [2.46] 
2m 
If these were numbers, it would be easy: 
u? +v = (iu +v)(—iu + v). 


Here, however, it’s not quite so simple, because p and x are operators, and oper- 
ators do not, in general, commute (xp is not the same as px). Still, this does 
motivate us to examine the quantities 


1 


Sinn (Fip + mox) [2.47] 


(the factor in front is just there to make the final results look nicer). 
Well, what is the product a_a+? 


l 
4-4 = ——— (ip + mwx)(—Ip + mox) 
2hmw 


l 
= [p? + (nex)? — imw(xp — px)]. 
2hinw 


'7We'll encounter some of the same strategies in the theory of angular momentum (Chapter 4), 
and the technique generalizes to a broad class of potentials in super-symmetric quantum mechanics 
(sce. for example. Richard W. Robinett, Quantum Mechanics. (Oxford U.P., New York, 1997). Section 
14.4). 


Section 2.3: The Harmonic Oscillator 43 


As anticipated, there’s an extra term, involving (xp — px). We call this the com- 
mutator of x and p; it is a measure of how badly they fail to commute. In general, 
the commutator of operators A and B (written with square brackets) is 


[A,B] = AB — BA. [2.48] 


In this notation, 


aay [p? + (mox)?] — ji p]. [2.49] 


= 2hmo 


We need to figure out the commutator of x and p. Warning: Operators are 
notoriously slippery to work with in the abstract, and you are bound to make 
mistakes unless you give them a “test function," f(x), to act on. At the end you 
can throw away the test function, and you'll be left with an equation involving the 
operators alone. In the present case we have: 


Js 
dx 


h d hd 
[x. p]f (x) = EO — ron == T 


l 


= f) = ih f(x). 
[2.50] 
Dropping the test function, which has served its purpose, 


This lovely and ubiquitous result is known as the canonical commutation rela- 
tion.!? 
With this, Equation 2.49 becomes 


l 1 
a4 =>—H+-. 2.52 
a-a} = 7 H+ 5 [2.52] 

or i 
H = ħw (o-a — >) . [2.53] 


Evidently the Hamiltonian does nof factor perfectly—there's that extra —1/2 on the 
right. Notice that the ordering of a+ and a_ is important here; the same argument, 
with a+ on the left, yields 


~=—H--. 2.54 
aa ho 2 [ ] 

In particular, 
[a-, a4] = I. [2.55] 


'81n a deep sense all of the mysteries of quantum mechanics can be traced to the fact that position 
and momentum do not commute. Indeed, some authors take the canonical commutation relation as an 
axiom of the theory. and use it to derive p = (h/i)d/dx. 
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So the Hamiltonian can equally well be written 


H — ho (aa + z) ; [2.56] 


In terms of a+, then, the Schrödinger equation!? for the harmonic oscillator takes 
the form 


hw (asas Ea ;) y = Ey [2.57] 


(in equations like this you read the upper signs all the way across, or else the lower 
signs). 

Now, here comes the crucial step: I claim that if y satisfies the Schrödinger 
equation with energy E, (that is: Hy = Ey), then ag. satisfies the Schrödinger 
equation with energy (E + hw): H(azw) = (E + ħw)(a+y). Proof: 


H (ay) = ho (ara + ;) (a. V) = hio c + je) Vy 


= hoa, (o-a + 5) y =a |o (ca Ts >) J 
=a4(H + ho)y =a4(E + ho)y = (E + ho)(a4 y). 


(I used Equation 2.55 to replace a_ay by a+a— + 1, in the second line. Notice 
that whereas the ordering of a; and a_ does matter, the ordering of a+ and 
any constants—such as Ai, w, and E—does not; an operator commutes with any 
constant.) 

By the same token, a_y is a solution with energy (E — fiw): 


H(a_yp) = ho (a-a = ;) (a-y) = hoa- (a - ;) y 


ed | no (oa —]- 5) «| = a_(H — ho)y — a. (E — ho)y 
= (E = ho)(a- yr). 


Here, then, is a wonderful machine for generating new solutions, with higher and 
lower energies—if we could just find one solution, to get started! We call a+ 
ladder operators, because they allow us to climb up and down in energy; a+ is 
the raising operator, and a_ the lowering operator. The “ladder” of states is 
illustrated in Figure 2.5. 


I9T'm getting tired of writing “time-independent Schrödinger equation," so when it's clear from 
the context which one I mean, I'll just call it the “Schrödinger equation.” 
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FIGURE 2.5: The “ladder” of states for the harmonic oscillator. 


But wait! What if I apply the lowering operator repeatedly? Eventually I’m 
going to reach a state with energy less than zero, which (according to the general 
theorem in Problem 2.2) does not exist! At some point the machine must fail. 
How can that happen? We know that a_y is a new solution to the Schrödinger 
equation, but there is no guarantee that it will be normalizable —it might be zero, 
or its square-integral might be infinite. In practice it is the former: There occurs a 
“lowest rung" (call it y/o) such that 


a. o = 0. [2.58] 


We can use this to determine wo(x): 


1 d 
JV 2hmo ( dx 


46 


Chapter 2 Time-Independent Schrödinger Equation 


Or 
dg mo 
ue cs ED 
This differential equation is easy to solve: 
do — | mo mw 
— xdx => Ihw=-— constant, 
do "E X Vo = Ph x? + constan 
SO 
Vo) = Ae SP. 
We might as well normalize it right away: 
g "ex^ Ji zh 
I = JA]? dx = |A|? 
mw 


so A? = /mw/mh, and hence 


[2.59] 


To determine the energy of this state we plug it into the Schródinger equation (in 
the form of Equation 2.57), lie (a.a. + 1/2) yo = Eowo, and exploit the fact that 
ao —0 


With our foot now securely planted on the bottom rung (the ground state of the 
quantum oscillator), we simply apply the raising operator (repeatedly) to generate 
the excited states," increasing the energy by fw with each step: 


Va(x) = An (a+)" Vo(x). with E, = ¢ T 5) ho, [2.61] 


where A, is the normalization constant. By applying the raising operator (repeat- 
edly) to Wo, then, we can (in principle) construct all?! the stationary states of 


?0In the case of the harmonic oscillator it is customary. for some reason, to depart from the usual 
ractice, and number the states starting with n = 0, instead of n = 1. Obviously, the lower limit on the 
practi l & : y 
sum in a formula such as Equation 2.17 should be altered accordingly. 


?! Note that we obtain all the (normalizable) solutions by this procedure. For if there were some 
other solution. we could generate from it a second ladder. by repeated application of the raising and 
lowering operators. But the bottom rung of this new ladder would have to satisfy Equation 2.58, and 
since that leads inexorably to Equation 2.59, the bottom rungs would be the same, and hence the two 
ladders would in fact be identical. 
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the harmonic oscillator. Meanwhile, without ever doing that explicitly, we have 


determined the allowed energies. 


Example 2.4 Find the first excited state of the harmonic oscillator. 


Solution: Using Equation 2.61, 


Ái d mo^ ^ oma 2 
(x) 2 Aja —-————[-h— + max | | — e Mn* 
Vi ) I 4- Vo Jalimo ( dx ) (= ) 


A mw 1/4 2mw -uo y? 
= A; | — xe 3^, 
zh h 


We can normalize it “by hand": 


2 oo mw 2 
[testa —( =e) f a EA 


[2.62] 


xA \ h Jl] 


so, as it happens, A; = 1. 

I wouldn't want to calculate W5q this way (applying the raising operator fifty 
times!), but never mind: In principle Equation 2.61 does the job— except for the 
normalization. 


You can even get the normalization algebraically, but it takes some fancy 
footwork, so watch closely. We know that a+ y, is proportional to V, 41, 


a4Wn =CnWnei, a-WYn = dnr Yn- [2.63] 


but what are the proportionality factors, c, and d,? First note that for “any” 22 


functions f(x) and g(x), 


oo oo 
J f*(a+8)dx = Í (az f)*g dx. [2.64] 
"o en 


(In the language of linear algebra, a4 is the hermitian conjugate of a+.) 
Proof: 


= * d D k et d 
HX EUN = -5 —— mox b 
[if (a4 g) |.) Gas m Je X, 


22Of course, the integrals must exist, and this means that f(x) and g(x) must go to zero at 
too. 
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and integration by parts takes f f*(dg/dx) dx to — f (df/dx)*g dx (the boundary 
terms vanish, for the reason indicated in footnote 22), so 


iz i" (adeo z= | NIS n+ max) f| edx= f (ax f)*g dx. 


QED 
In particular, 


J (a+ Wn) (at Wn) dx = J (day Va)" Vn dx. 


—oo0 =00 


But (invoking Equations 2.57 and 2.61) 
Gd, — nya. d-ad V, — (n+ Dn. [2.65] 


SO 


i (a+ Yn)" (as) dx = jal? f (Wael? dx = (n + bf Ius P dx. 


> (a_Wn) )'(a-y,) dx = dul 2 IVa-il^ Manon" lys |? dx. 


But since y, and y, +) are normalized, it follows that lc? — n-F 1 and |d,|? =n, 
and hence 


Gr, =VN+1 Wnt, a-n = nya. [2.66] 
Thus 
Vr aude. dio ayi = (i) 
1 = d4 Vo. mw ud RW os 0: 
UE MM NER SR Y E 
ya = par 439 Vo. V4 quum was Vo. 


and so on. Clearly 


[2.67] 


which is to say that the normalization factor in Equation 2.61 is An = 1/4/n! (in 
particular, Aj = 1, confirming our result in Example 2.4). 
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As in the case of the infinite square well, the stationary states of the harmonic 


oscillator are orthogonal: 
co 


Vin Wn dX = ômn- [2.68] 


This can be proved using Equation 2.65, and Equation 2.64 twice—first moving 
a+ and then moving a_: 


2i oo 
J V (a.a -)y, dx =n J V V, dx 
BS e 
e oo 
~ J (a—Wm)*(a-Wn) dx = / (aya_ Vm) Wn dx 
Em s 


oo 
=m J V Vs dx. 
-—O0 


Unless m = n, then, f Ww, dx must be zero. Orthonormality means that we 
can again use Fourier's trick (Equation 2.34) to evaluate the coefficients, when we 
expand WV(x,0) as a linear combination of stationary states (Equation 2.16), and 
lcn|* is again the probability that a measurement of the energy would yield the 
value E,. 


Example 2.5 Find the expectation value of the potential energy in the nth state 
of the harmonic oscillator. 


Solution: 
l 22 MY iT. 
(V) = jme x = me " X^ Vn dx. 
—oc 


There's a beautiful device for evaluating integrals of this kind (involving powers 


of x or p): Use the definition (Equation 2.47) to express x and p in terms of the 
raising and lowering operators: 


| h |h 
y= S On +a-): pat P (ay —a_). [2.69] 
2mo 2 


In this example we are interested in x?: 
2_ A 2 2 
x? = =| (ay)? + (aga) + (a-a) + G- |. 
2mw 


So 


(V) = - J Vs [(a4)? + (aya) + (aay) + (| Wn dx. 
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But (a4)? is (apart from normalization) 442, which is orthogonal to Yn, and 
the same goes for (a)? Wn, which is proportional to V, 3. So those terms drop 
out, and we can usc Equation 2.65 to evaluate the remaining two: 


ho 


l ] 
y= " (n-ntfb- jon 7). 


As it happens, the expectation value of the potential energy is exactly half the 
total (the other half, of course, is kinetic). This is a peculiarity of the harmonic 
oscillator, as we'll see later on. 


xProblem 2.10 
(a) Construct W(x). 
(b) Sketch Wo, Wi, and yo. 


(c) Check the orthogonality of wo, Wi, and yo, by explicit integration. Hint: If 
you exploit the even-ness and odd-ness of the functions, there is really only 
one integral left to do. 


xProblem 2.11 


(a) Compute (x), (p). (x?), and (p?), for the states Wo (Equation 2.59) and y 
(Equation 2.62), by explicit integration. Comment: In this and other problems 
involving the harmonic oscillator it simplifies matters if you introduce the 
variable £ = J/mco/Ri x and the constant a = (mw/mh)!/4. 


(b) Check the uncertainty principle for these states. 


(c) Compute (T) (the average kinetic energy) and (V) (the average potential 
energy) for these states. (No new integration allowed!) Is their sum what you 
would expect? 


xProblem 2.12 Find (x), (p), (x?), (p?), and (T), for the nth stationary state of the 
harmonic oscillator, using the method of Example 2.5. Check that the uncertainty 
principle is satisfied. 


Problem 2.13 A particle in the harmonic oscillator potential starts out in the state 
V(x.0) = A[3wo(x) + 4v 1(x)]. 


(a) Find A. 
(b) Construct V(x, r) and |Y (x. 0) ?. 
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(c) Find (x) and (p). Don’t get too excited if they oscillate at the classical 
frequency; what would it have been had I specified yr? (x), instead of yr (x)? 
Check that Ehrenfest' s theorem (Equation 1.38) holds for this wave function. 


(d) If you measured the energy of this particle, what values might you get, and 
with what probabilities? 


Problem 2.14 A particle is in the ground state of the harmonic oscillator with 
classical frequency w, when suddenly the spring constant quadruples, so w = 2w, 
without initially changing the wave function (of course, V will now evolve differ- 
ently, because the Hamiltonian has changed). What is the probability that a mea- 
surement of the energy would still return the value fiw/2? What is the probability 
of getting iw? [Answer: 0.943.] 


2.3.2. Analytic Method 


We return now to the Schródinger equation for the harmonic oscillator, 
aa kee as E 
-—— +- ow = Ey. 2.70 
2m dx? r DRR y y Hl 


and solve it directly, by the series method. Things look a little cleaner if we 
introduce the dimensionless variable 


REN [2.71] 


in terms of & the Schródinger equation reads 


ay 5 
= — K)y, 2.72 
ger = PK [2.72] 
where K is the energy, in units of (1/2)fiw: 
2E 
K = —. [2.73] 
ħw 


Our problem is to solve Equation 2.72, and in the process obtain the “allowed” 
values of K (and hence of E). 

To begin with, note that at very large £ (which is to say, at very large x), £? 
completely dominates over the constant K, so in this regime 


d^ os 
de zx E^ yr, [2.74] 


which has the approximate solution (check it!) 


YE) & Ae € 4. ge? [2.75] 
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The B term is clearly not normalizable (it blows up as |x| — oo); the physically 
acceptable solutions, then, have the asymptotic form 


WE) > Coe, at large €. [2.76] 
This suggests that we "peel off" the exponential part, 
WE) = he, [2.77] 


in hopes that what remains, A (E), has a simpler functional form than y (5) itself.?? 
Differentiating Equation 2.77, 


ey = & — sn) etn, 


d£ dé 
and 
aA & cM ere bi) fein 
d£? d£? dé l 
so the Schrödinger equation (Equation 2.72) becomes 
2 
Tg AHK- Dh = 0. [2.78] 
I propose to look for solutions to Equation 2.78 in the form of power series 
in gn 
oo 
h(E) San tag +a +---= I atl. [2.79] 
j=0 


Differentiating the series term by term, 


dh = 
dE = ai + 2a5£ +3036? +++. = 2 jaji, 
and 
d?h 2 E ak j 
d£? = 2a) + 2-3a3€ +3- Aa4£* +- = DOG + DG 4-283. 


j-0 


*3Note that although we invoked some approximations to motivate Equation 2.77. what fol- 
lows is exact. The device of stripping off the asymptotic behavior is the standard first step in 
the power series method for solving differential equations— see, for example. Boas (footnote 11). 
Chapter 12. 

"This is known as the Frobenius method for solving a differential equation. According to 
Taylor's theorem. any reasonably well-behaved function can be expressed as a power scries, so 
Equation 2.79 ordinarily involves no loss of gencrality. For conditions on the applicability of the 
method. see Boas (footnote 11) or George B. Arfken and Hans-Jurgen Weber, Mathematical Methods 
for Physicists. 5th cd., Academic Press. Orlando (2000). Section 8.5. 
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Putting these into Equation 2.78, we find 


Y [G + DG + 2ajaa — 2jaj +(K — 1)aj]é/ — 0. [2.80] 
j=0 


It follows (from the uniqueness of power series expansions?) that the coefficient 
of each power of E must vanish, 


G * DG +2)aj42 — 2jaj +(K — Daj —0, 


and hence that 


QjT-1-K) 
ajy = —— — ——— 


z = aj. 
(EDO +2) 
This recursion formula is entirely equivalent to the Schródinger equation. 
Starting with ag, it generates all the even-numbered coefficients: 


[2.81] 


(1— K) GK) (EMI —K) 
E MS ET er 24 201 


and starting with a), it generates the odd coefficients: 


a» = 


SGER QR). Ded E 
diee SS eS 120 ja 
We write the complete solution as 
h(E) = heven (5) + Road lE). [2.82] 


where 

heven(£) = a0 +a? + agé* + e 
is an even function of &, built on ag, and 

hoga(£) = ai + as£? -- as£? +- 


is an odd function, built on a;. Thus Equation 2.81 determines A(£) in terms of 
two arbitrary constants (ao and aj)— which is just what we would expect, for a 
second-order differential equation. 

However, not all the solutions so obtained are normalizable. For at very large 
j. the recursion formula becomes (approximately) 


N 


aj42 ~ -aj 
j43 a E 
J 


25Sec. for example. Arfken (footnote 24). Section 5.7. 
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with the (approximate) solution 
aj FR: E 
da 


for some constant C, and this yields (at large £, where the higher powers dominate) 


x E eja ey oe 
h(E)e c. TIS b n Ce. 
Now, if h goes like exp(£?), then y (remember y/?—that's what we're trying to 
calculate) goes like exp(£?/2) (Equation 2.77), which is precisely the asymptotic 
behavior we didn't want.” There is only one way to wiggle out of this: For 
normalizable solutions the power series must terminate. There must occur some 
"highest" j (call it 1), such that the recursion formula spits out a,42 = O (this will 
truncate either the series Meyen or the series //944; the other one must be zero from 
the start: a; = 0 if n is even, and ag = 0 if n is odd). For physically acceptable 
solutions, then, Equation 2.81 requires that 


K —2n 4 l. 


for some non-negative integer n, which is to say (referring to Equation 2.73) that 
the energy must be 


l 
En = ¢ + z) ho. forn =0.1,2..... [2.83] 


Thus we recover, by a completely different method, the fundamental quantization 
condition we found algebraically in Equation 2.61. 

It seems at first rather surprising that the quantization of energy should 
emerge from a technical detail in the power series solution to the Schrödinger 
equation, but let’s look at it from a different perspective. Equation 2.70 has 
solutions, of course, for any value of E (in fact, it has two linearly independent 
solutions for every E). But almost all of these solutions blow up exponentially at 
large x, and hence are not normalizable. Imagine, for example, using an E that 
is slightly less than one of the allowed values (say, 0.49/iw), and plotting the 
solution (Figure 2.6(a)); the “tails” fly off to infinity. Now try an E slightly larger 
(say, 0.51710); the “tails” now blow up in the other direction (Figure 2.6(b)). As 
you tweak the parameter in tiny increments from 0.49 to 0.51, the tails flip over 
when you pass through 0.5—only at precisely 0.5 do the tails go to zero, leaving 
a normalizable solution?" 


?6]r's no surprise that the ill-behaved solutions are still contained in Equation 2.81: this recursion 
relation is equivalent to the Schrödinger equation, so it's ger to include both the asymptotic forms we 
found in Equation 2.75. 

Fris possible to set this up on a computer. and discover the allowed energies “experimentally.” 


You might call it the wag the dog method: When the tail wags. you know you've just passed over an 
allowed value. See Problems 2.54-2.56. 
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FIGURE 2.6: Solutions to the Schródinger equation for (a) E = 0.49 fiw, and 
(b) E 2 0.51 ho. 


For the allowed values of K, the recursion formula reads 


—2(n — J) 


TEXTE A ES 


aj+2 = 


If n = 0, there is only one term in the series (we must pick a, = 0 to kill Noda, 
and j = 0 in Equation 2.84 yields a? = 0): 


hol) = ao. 


and hence 


Vo(£) = age * 7? 
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(which, apart from the normalization, reproduces Equation 2.59). For n = 1 we 
take ao = 0,78 and Equation 2.84 with j = 1 yields a3 = 0, so 


hy(é)=aé. 


and hence 


pE) = aget. 


(confirming Equation 2.62). For n = 2, j = 0 yields az = —2ao, and j = 2 gives 
a4 = 0, SO 


hy(£) = ag(1 — 2£?). 


and 


Va(E) = ag(1 — 22e * 72, 


and so on. (Compare Problem 2.10, where this last result was obtained by algebraic 
means.) 

In general, h, (E) will be a polynomial of degree n in £, involving even powers 
only, if n is an even integer, and odd powers only, if n is an odd integer. Apart 
from the overall factor (ao or aj) they are the so-called Hermite polynomials, 
H, (£).? The first few of them are listed in Table 2.1. By tradition, the arbitrary 
multiplicative factor is chosen so that the coefficient of the highest power of & 
is 2". With this convention, the normalized?? stationary states for the harmonic 
oscillator are 


1/4 
mo ] E 
cT ETE zB 
Une) = (Z 2) yz; HO. [2.85] 


They are identical (of course) to the ones we obtained algebraically in Equation 2.67. 


TABLE 2.1: The first few Hermite 
polynomials, Hn (£). 


Ho=1, 
Hy, = 26, 
Hp = 4&2 - 2, 


H3 = 8&3 — 126, 
Hg = 16&4 — 48&? + 12, 
Hs = 328? — 1608? + 1208. 


28Note that there is a completely different set of coctficients a; for each value of n. 
pletely j 


The Hermite polynomials have been studied extensively in the mathematical literature, and 
there are many tools and tricks for working with them. A few of these are explored in Problem 2.17. 


30] shall not work out the normalization constant here: if you are interested in knowing how it is 
done, see for example Leonard Schiff, Quantum Mechanics, 3rd ed., McGraw-Hill, New York (1968). 
Section 13. 
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In Figure 2.7(a) I have plotted w,,(x) for the first few n’s. The quantum 
oscillator is strikingly different from its classical counterpart — not only are the 
energies quantized, but the position distributions have some bizarre features. For 
instance, the probability of finding the particle outside the classically allowed range 
(that is, with x greater than the classical amplitude for the energy in question) is 
not zero (see Problem 2.15), and in all odd states the probability of finding the 
particle at the center is zero. Only at large n do we begin to see some resemblance 
to the classical case. In Figure 2.7(b) I have superimposed the classical position 
distribution on the quantum one (for n = 100); if you smoothed out the bumps, 
the two would fit pretty well (however, in the classical case we are talking about 
the distribution of positions over time for one oscillator, whereas in the quantum 
case we are talking about the distribution over an ensemble of identically prepared 
systems).?! 


Problem 2.15 In the ground state of the harmonic oscillator, what is the probability 
(correct to three significant digits) of finding the particle outside the classically 
allowed region? Hint: Classically, the energy of an oscillator is E = (1/2)ka? = 
(1/2)mo?a?, where a is the amplitude. So the “classically allowed region" for an 
oscillator of energy E extends from —/2E /mo? to J-/2E / mo. Look in a math 
table under “Normal Distribution" or “Error Function" for the numerical value of 
the integral. 


Problem 2.16 Use the recursion formula (Equation 2.84) to work out Hs5(£) and 
Hq(&). Invoke the convention that the coefficient of the highest power of £ is 2" 
to fix the overall constant. 


x «Problem 2.17 In this problem we explore some of the more useful theorems (stated 
without proof) involving Hermite polynomials. 
(a) The Rodrigues formula says that 
s d n 2 
H,(E) = (—1)" eë (=) e. [2.86] 
Use it to derive H3 and H4. 


(b) The following recursion relation gives you Hy+ in terms of the two preced- 
ing Hermite polynomials: 


Hy (E) = 28 Ay (E) — 2n Hy, (8). [2.87] 


Use it, together with your answer in (a), to obtain H5 and Hg. 


3! The parallel is perhaps more direct if you interpret the classical distribution as an ensemble of 
oscillators all with the same energy. but with random starting times. 
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FIGURE 2.7: (a) The first four stationary states of the harmonic oscillator. This 
material is used by permission of John Wiley & Sons, Inc.; Stephen Gasiorowicz, 
Quantum Physics, john Wiley & Sons, Inc., 1974. (b) Graph of |¥199|7, with the 
classical distribution (dashed curve) superimposed. 
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(c) If you differentiate an nth-order polynomial, you get a polynomial of order 
(n — 1). For the Hermite polynomials, in fact, 


d Hy 
dé 
Check this, by differentiating H5 and Hg. 


— 2n H, (E). [2.88] 


(d) H,,(&) is the nth z-derivative, at z = 0, of the generating function exp(—z? + 
2z£); or, to put it another way, it is the coefficient of z"/n! in the Taylor 
series expansion for this function: 


co 


242: z 
e LUPO [2.89] 


n=0 


Use this to rederive Ho, Hj, and Ap. 


2.4 THE FREE PARTICLE 


We turn next to what should have been the simplest case of all: the free particle 
(V(x) = 0 everywhere). Classically this would just mean motion at constant veloc- 
ity, but in quantum mechanics the problem is surprisingly subtle and tricky. The 
time-independent Schrédinger equation reads 


h? dw 
or 
d? V2mE 
j y — Ky, where k = T [2.91] 
NT 1 


So far, it’s the same as inside the infinite square well (Equation 2.21), where the 
potential is also zero; this time, however, I prefer to write the general solution in 
exponential form (instead of sines and cosines), for reasons that will appear in due 
course: 

y (x) = Ae™ + Bei, [2.92] 


Unlike the infinite square well, there are no boundary conditions to restrict the 
possible values of k (and hence of E); the free particle can carry any (positive) 
energy. Tacking on the standard time dependence, exp(—iEt/h), 


TITIUS —ik(x4- fh ` 
Wx, t) = Aei^* 80 Be GELD [2.93] 


Now, any function of x and t that depends on these variables in the special 
combination (x = vt) (for some constant v) represents a wave of fixed profile, 
traveling in the +x-direction, at speed v. A fixed point on the waveform (for 
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example, a maximum or a minimum) corresponds to a fixed value of the argument, 
and hence to x and ¢ such that 


x + vt = constant. or x —-rvt + constant. 


Since every point on the waveform is moving along with the same velocity, its 
shape doesn’t change as it propagates. Thus the first term in Equation 2.93 repre- 
sents a wave traveling to the right, and the second represents a wave (of the same 
energy) going to the left. By the way, since they only differ by the sign in front of 
k, we might as well write 


V, Gr. t) = Ae C7 5, [2.94] 


and let k run negative to cover the case of waves traveling to the left: 


2 E . . . 
J2m 2 unm | k»0- traveling to the right, (2.95] 


k«0-2 traveling to the left. 


Evidently the “stationary states" of the free particle are propagating waves; their 
wavelength is A = 27r /|k|, and, according to the de Broglie formula (Equation 1.39), 
they carry momentum 

p — hk. [2.96] 


The speed of these waves (the coefficient of t over the coefficient of x) is 


h|k| | E 
Uquantum — 3m LI on [2.97] 


On the other hand, the classical speed of a free particle with energy E is given by 
E = (1/2)mv? (pure kinetic, since V = 0), so 


2E 


y eX 2 Uquantum- [2.98] 
n 


Uclassical = 
Apparently the quantum mechanical wave function travels at half the speed of the 
particle it is supposed to represent! We'll return to this paradox in a moment— there 
is an even more serious problem we need to confront first: This wave function is 
not normalizable. For 


+00 +o 
| WI, dx = iAP f dx = |A|? (00). [2.99] 
—O0 —00 

In the case of the free particle, then, the separable solutions do not represent 
physically realizable states. A free particle cannot exist in a stationary state; or, 
to put it another way, there is no such thing as a free particle with a definite 
energy. 
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But that doesn't mean the separable solutions are of no use to us, for they 
play a mathematical role that is entirely independent of their physical interpre- 
tation. The general solution to the time-dependent Schródinger equation is still a 
linear combination of separable solutions (only this time it’s an integral over the 
continuous variable k, instead of a sum over the discrete index 7): 


W(x,t) = (el 7 5D dk. [2.100] 


] Fog 
J/ 2x |. 


(The quantity 1/./27 is factored out for convenience; what plays the role of the 
coefficient c, in Equation 2.17 is the combination (1 /V21)o(k) dk.) Now this 
wave function can be normalized (for appropriate @(k)). But it necessarily carries 
a range of k’s, and hence a range of energies and speeds. We call it a wave 
packet.?? 

In the generic quantum problem, we are given W(x, 0), and we are asked to 
find V (x.t). For a free particle the solution takes the form of Equation 2.100; 
the only question is how to determine @(k) so as to match the initial wave 


| J/2n —00 


This is a classic problem in Fourier analysis; the answer is provided by Plancherel's 
theorem (see Problem 2.20): 


+00 . : 
f= Í 7 F(k)e™ dk e— F(k)= foe dx.| [2.102] 


l +00 
/20 J 


F (k) is called the Fourier transform of f(x); f (x) is the inverse Fourier trans- 
form of F(k) (the only difference is in the sign of the exponent). There is, of 
course, some restriction on the allowable functions: The integrals have to exist.?? 
For our purposes this is guaranteed by the physical requirement that V (x. 0) itself 


32 Sinusoidal waves extend out to infinity. and they are not normalizable. But superpositions of 
such waves lead to interference. which allows for localization and normalizability. 


3 The necessary and sufficient condition on f(x) is that JS | fx) dx be finite. (In that 


case T, |F (k)|"dk is also finite, and in fact the two integrals are equal.) See Arlken (footnote 24). 
Section 15.5. 
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be normalized. So the solution to the generic quantum problem, for the free particle, 
is Equation 2.100, with 


V (x. 0e dx. [2.103] 


"Zu. 


Example 2.6 A free particle, which is initially localized in the range —a < x <a, 
is released at time t = 0: 


A, if -a<x <a, 
W(x, 0) = | 0. otherwise, 


where A and a are positive real constants. Find W(x, t). 


Solution: First we need to normalize W(x. 0): 


eo a 
=f Y.Q dx =|AP ] dx 22a|A? > A=—=. 
= = V2a 


Next we calculate @(k), using Equation 2.103: 


—ikx 
e "d l e s 


iim ml. RE CE lea 


7 l ika _ ,—ika H l sin(ka) 
E 2777 2i ^na k ` 
Finally, we plug this back into Equation 2.100: 


OD RI hk? 


y(x. t) = =n dk. [2.104] 


xml 


Unfortunately, this integral cannot be solved in terms of elementary functions, 
though it can of course be evaluated numerically (Figure 2.8). (There are, in fact, 
precious few cases in which the integral for Y (x.t) (Equation 2.100) can be cal- 
culated explicitly; see Problem 2.22 for a particularly beautiful example.) 

It is illuminating to explore the limiting cases. If a is very small, the starting 
wave function is a nicely localized spike (Figure 2.9(a)). In this case we can use 
the small angle approximation to write sin(ka) ~% ka, and hence 


poa |S 
T 
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| wx, t)l? 


s 4 2 0 2 4 6 


FIGURE 2.8: Graph of |W(x, t)|? (Equation 2.104) at £ = 0 (the rectangle) and at 
t = ma? /h (the curve). 


(k) 


aln 


-ala X k 


(a) (b) 
FIGURE 2.9: Example 2.6, for small a. (a) Graph of ¥ (x, 0). (b) Graph of $ (k). 


it’s flat, since the k’s cancelled out (Figure 2.9(b)). This is an example of the 
uncertainty principle: If the spread in position is small, the spread in momentum 
(and hence in k— see Equation 2.96) must be large. At the other extreme (large 
a) the spread in position is broad (Figure 2.10(a)) and 


5 = [2 
x ka 


Now, sinz/z has its maximum at z = 0, and drops to zero at z = + 7 (which, in 
this context, means k = t 7/a). So for large a, @(k) is a sharp spike about k = 0 
(Figure 2.10(b)). This time it’s got a well-defined momentum but an ill-defined 
position. 
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Y(x, 0) 


(a) (b) 


FIGURE 2.10: Example 2.6, for large a. (a) Graph of W(x, 0). (b) Graph of $(k). 


I return now to the paradox noted earlier: the fact that the separable solution 
V.(x,f) in Equation 2.94 travels at the “wrong” speed for the particle it osten- 
sibly represents. Strictly speaking, the problem evaporated when we discovered 
that V, is not a physically realizable state. Nevertheless, it is of interest to dis- 
cover how information about velocity is contained in the free particle wave function 
(Equation 2.100). The essential idea is this: A wave packet is a superposition of 
sinusoidal functions whose amplitude is modulated by $ (Figure 2.11); it consists of 
"ripples" contained within an "envelope." What corresponds to the particle velocity 
is not the speed of the individual ripples (the so-called phase velocity), but rather 
the speed of the envelope (the group velocity) — which, depending on the nature 
of the waves, can be greater than, less than, or equal to, the velocity of the ripples 
that go to make it up. For waves on a string, the group velocity is the same as the 
phase velocity. For water waves it is one-half the phase velocity, as you may have 
noticed when you toss a rock into a pond (if you concentrate on a particular ripple, 
you will see it build up from the rear, move forward through the group, and fade 
away at the front, while the group as a whole propagates out at half the speed). What 
I need to show is that for the wave function of a free particle in quantum mechanics 


FIGURE 2.11: A wave packet. The “‘enve- 
lope” travels at the group velocity; the “rip- 
ples” travel at the phase velocity. 
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the group velocity is twice the phase velocity—yust right to represent the classical 
particle speed. 

The problem, then, is to determine the group velocity of a wave packet with 
the general form 


W(x,t) = (kjede) dk. 


l +00 
ml. 
(In our case w = (hk?/2m), but what I have to say now applies to any kind 
of wave packet, regardless of its dispersion relation—the formula for w as a 
function of k.) Let us assume that $(k) is narrowly peaked about some particular 
value ko. (There is nothing illegal about a broad spread in k, but such wave packets 
change shape rapidly— since different components travel at different speeds—so 
the whole notion of a "group," with a well-defined velocity, loses its meaning.) 
Since the integrand is negligible except in the vicinity of ko, we may as well 
Taylor-expand the function w(k) about that point, and keep only the leading terms: 


w(k) = wo + ex (k — ko). 


where wy is the derivative of w with respect to k, at the point ko. 
Changing variables from k to s = k — Ko (to center the integral at kg), we 
have 


] us : 
W(x, t) = — $ (kg + syetlhots)x—(on test] que 


V2 —oo0 
At f =0, 


va, 0) = —— ee +set qs 
icd «V 27 J—oo i l 


and at later times 
l3 (AQ fro ; ; 
V(x. t) = oe, o (ko M sje! Cuts) Greg) ds. 
T —co 


Except for the shift from x to (x — Wot), the integral is the same as the one in 
V (x, 0). Thus 
V (x, t) Zi eI Conor y (x — wht. 0). [2.105] 


Apart from the phase factor in front (which won't affect |W|* in any event) the 
wave packet evidently moves along at a speed w): 


do 
Ugroup — p [2.106] 


(evaluated at k — ko). This is to be contrasted with the ordinary phase velocity 


e 
Uphase — T . [2.107] 
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In our case, w = (hk?/2m), so w/k = (hk/2m), whereas dw/dk = (hk/m), which 
is twice as great. This confirms that it is the group velocity of the wave packet, 
not the phase velocity of the stationary states, that matches the classical. particle 
velocity: 

Uclassical = Ugroup = 2 Uphase- [2.108] 


Problem 2.18 Show that [Ae/** + Be-**] and [C cos kx + D sin kx] are equivalent 
ways of writing the same function of x, and determine the constants C and D in 
terms of A and B, and vice versa. Comment: In quantum mechanics, when V = 0, 
the exponentials represent traveling waves, and are most convenient in discussing 
the free particle, whereas sines and cosines correspond to standing waves, which 
arise naturally in the case of the infinite square well. 


Problem 2.19 Find the probability current, J (Problem 1.14) for the free particle 
wave function Equation 2.94. Which direction does the probability current flow? 


x xProblem 2.20 This problem is designed to guide you through a "proof" of Plan- 
cherel's theorem, by starting with the theory of ordinary Fourier series on a finite 
interval, and allowing that interval to expand to infinity. 


(a) Dirichlet's theorem says that "any" function f(x) on the interval [—«, +a] 
can be expanded as a Fourier series: 


fox Yo, sin(nz x/a) + b, cos(nz x/a)]. 


n=0 


Show that this can be written equivalently as 


o0 
f(x) = > c, einnx/a. 


n=—Co 
What is c,, in terms of a, and b,? 
(b) Show (by appropriate modification of Fourier’s trick) that 
| eas TM 
Ch = val faena dx. 
2a J—a 


(c) Eliminate n and c, in favor of the new variables k = (nz/a) and F(k) = 
/ 2/7 ac,. Show that (a) and (b) now become 
MES ila De i ik 
— F(k)e'"" Ak; |. F(k) = — fooe !" dx, 
V2 DM A 2X J-a 


where Ak is the increment in k from one n to the next. 


fx)= 
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(d) Take the limit a — oo to obtain Plancherel’s theorem. Comment: In view 
of their quite different origins, it is surprising (and delightful) that the two 
formulas—one for F(k) in terms of f(x), the other for f(x) in terms of 
F (k)—have such a similar structure in the limit a — oo. 


Problem 2.21 A free particle has the initial wave function 
V(x.0) = Ae@ I, 
where A and a are positive real constants. 
(a) Normalize V(x. 0). 
(b) Find $ (X). 
(c) Construct V (x, f), in the form of an integral. 


(d) Discuss the limiting cases (a very large, and a very small). 


xProblem 2.22 The gaussian wave packet. A free particle has the initial wave 
function 


W(x, 0) = Ae, 
where A and a are constants (a is real and positive). 
(a) Normalize V (x. 0). 
(b) Find W(x, f). Hint: Integrals of the form 


Toc A 
J eT (ax +bx) dx 
-0 


can be handled by “completing the square”: Let y = Va [x + (b/2a)], and 
note that (ax? + bx) = y* — (b^/4a). Answer: 


V (x. f) ey eax? Al Qihat /m)] 
X. = | — ee a ee 
i 1 + (2ihat/m) 


(c) Find |W(x, t)/?. Express your answer in terms of the quantity 


ect een 
1 + (2hat/m)? 


Sketch |W|? (as a function of x) at t = 0, and again for some very large t. 
Qualitatively, what happens to ||”, as time goes on? 

(d) Find (x), (p), (x7), (p?). ox, and op. Partial answer: (p) = ah?*, but it 
may take some algebra to reduce it to this simple form. 


(e) Does the uncertainty principle hold? At what time t does the system come 
closest to the uncertainty limit? 
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2.5 THE DELTA-FUNCTION POTENTIAL 


2.5.1 Bound States and Scattering States 


We have encountered two very different kinds of solutions to the time-independent 
Schrédinger equation: For the infinite square well and the harmonic oscillator they 
are normalizable, and labeled by a discrete index n; for the free particle they are 
non-normalizable, and labeled by a continuous variable k. The former represent 
physically realizable states in their own right, the latter do not; but in both cases the 
general solution to the time-dependent Schródinger equation is a linear combination 
of stationary states—for the first type this combination takes the form of a sum 
(over n), whereas for the second it is an integral (over k). What is the physical 
significance of this distinction? 

In classical mechanics a one-dimensional time-independent potential can give 
rise to two rather different kinds of motion. If V (x) rises higher than the particle's 
total energy (E) on either side (Figure 2.12(a)), then the particle is "stuck" in the 
potential well—it rocks back and forth between the turning points, but it cannot 
escape (unless, of course, you provide it with a source of extra energy, such as 
a motor, but we're not talking about that). We call this a bound state. If, on the 
other hand, E exceeds V (x) on one side (or both), then the particle comes in from 
"infinity," slows down or speeds up under the influence of the potential, and returns 
to infinity (Figure 2.12(b)). (It can't get trapped in the potential unless there is some 
mechanism, such as friction, to dissipate energy, but again, we're not talking about 
that.) We call this a scattering state. Some potentials admit only bound states (for 
instance, the harmonic oscillator); some allow only scattering states (a potential 
hill with no dips in it, for example); some permit both kinds, depending on the 
energy of the particle. 

The two kinds of solutions to the Schródinger equation correspond precisely to 
bound and scattering states. The distinction is even cleaner in the quantum domain, 
because the phenomenon of tunneling (which we'll come to shortly) allows the 
particle to “leak” through any finite potential barrier, so the only thing that matters 
is the potential at infinity (Figure 2.12(c)): 


E-»|[V(-oo) or V(+00)]= scattering state. [2.109] 


| E «[V(—oo) and V(4oo)] 2 bound state, 
In "real life" most potentials go to zero at infinity, in which case the criterion 
simplifies even further: 


| E «0-2 bound state, [2.110] 


E >0= scattering state. 


Because the infinite square well and harmonic oscillator potentials go to infinity as 
x — too, they admit bound states only; because the free particle potential is zero 
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Classical turning points 


(a) 


— 


Classical turning point 


V(x) 


Classical turning points 


(c) 


FIGURE 2.12: (a) A bound state. (b) Scattering states. (c) A classical bound state, but 
a quantum scattering state. 


everywhere, it only allows scattering states.?^ In this section (and the following 
one) we shall explore potentials that give rise to both kinds of states. 


Sy you are irritatingly observant. you may have noticed that the general theorem requiring 
E > Vmin (Problem 2.2) does not really apply to scattering states. since they are not normalizable 
any way. If this bothers you, try solving the Schródinger equation with E x 0. for the free particle, and 
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S(x) 


FIGURE 2.13: The Dirac delta function 
x (Equation 2.111). 


2.5.2 The Delta-Function Well 


The Dirac delta function is an infinitely high, infinitesimally narrow spike at the 
origin, whose area is 1 (Figure 2.13): 


,_f 0 ifx 40 . iac D 
es | edo apes i with J ó(x)dx =l. [2.111] 


Technically, it isn't a function at all, since it is not finite at x = 0 (mathematicians 
call it a generalized function, or distribution).*> Nevertheless, it is an extremely 
useful construct in theoretical physics. (For example, in electrodynamics the charge 
density of a point charge is a delta function.) Notice that 5(x — a) would be a spike 
of area 1 at the point a. If you multiply 5(x — a) by an ordinary function f (x), 
it’s the same as multiplying by f(a), 


FO — a) 2 f(a)d(x — a), [2.112] 


because the product is zero anyway except at the point a. In particular, 


+00 


+00 
fœ) — a)dx = fa) f ó(x —a)dx = f(a). [2.113] 


—9oQ0 


That's the most important property of the delta function: Under the integral sign it 
serves to "pick out" the value of f(x) at the point a. (Of course, the integral need 
not go from —oo to +00; all that matters is that the domain of integration include 
the point a, so a — € to a + € would do, for any € > 0.) 

Let's consider a potential of the form 


V(x) = —aó(x). [2.114] 


note that even linear combinations of these solutions cannot be normalized. The positive energy solutions 
by themselves constitute a complete set. 


35The delta function can be thought of as the [mir of a sequence of functions, such as rectangles 
(or triangles) of ever-increasing height and ever-decreasing width. 
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where œ is some positive constant.?Ó This is an artificial potential, to be sure (so was 
the infinite square well), but it’s delightfully simple to work with, and illuminates 
the basic theory with a minimum of analytical clutter. The Schrédinger equation 
for the delta-function well reads 

he d'y 

—-——> —aó(x)y = Ey; 2.115 

zm g? T COY = EW: [2.115] 

it yields both bound states (E < 0) and scattering states (E > 0). 
We'll look first at the bound states. In the region x < 0, V(x) = 0, so 


d^ |, 2mE 


m x — y, [2.116] 
where 
/—2mE 
C= T. [2.117] 
1 


(E is negative, by assumption, so « is real and positive.) The general solution to 
Equation 2.116 is 
w(x) = Ae ** + Be", [2.118] 


but the first term blows up as x — —oo, so we must choose A = 0: 
w(x) = Be". (x <0). [2.119] 


In the region x > 0, V(x) is again zero, and the general solution is of the form 
F exp(—«x) + G exp(x«x); this time it’s the second term that blows up (as x > 
+20), so 
V(x)-2Fe". (x»0). [2.120] 
It remains only to stitch these two functions together, using the appropriate 
boundary conditions at x = 0. I quoted earlier the standard boundary conditions 
for y: 


| l. y is always continuous; [2.121] 


2.dw/dx is continuous except at points where the potential is infinite. 


In this case the first boundary condition tells us that F — B, so 


Be**, (x <0). 


vo) = | He. eo. [2.122] 


"CThe delta function itself carries units of length (see Equation 2.111). so œ has the dimensions 
enerey X length. 
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FIGURE 2.14: Bound state wave function for the delta-function potential (Equa- 
tion 2.122). 


w(x) is plotted in Figure 2.14. The second boundary condition tells us nothing; 
this is (like the infinite square well) the exceptional case where V is infinite at the 
join, and it’s clear from the graph that this function has a kink at x = 0. Moreover, 
up to this point the delta function has not come into the story at all. Evidently the 
delta function must determine the discontinuity in the derivative of y, at x = 0. 
PH show you now how this works, and as a by-product we'll see why dy/dx is 
ordinarily continuous. 

The idea is to integrate the Schrödinger equation, from —e to +€, and then 
take the limit as € — 0: 


h2 pte J2 +e +e 
i LH d dx «f Vix)Ww(x)dx = E w(x) dx. [2.123] 


2m J-e dx? = I 


The first integral is nothing but dy/dx, evaluated at the two end points; the last 
integral is zero, in the limit € — 0, since it's the area of a sliver with vanishing 
width and finite height. Thus 


du\ . (dw 
ZSA] dug 
S ( dx ) cU ( dx 


Typically, the limit on the right is again zero, and that’s why dy/dx is ordinarily 
continuous. But when V(x) is infinite at the boundary, this argument fails. In 
particular, if V(x) = —oó(x), Equation 2.113 yields 


^ (=) e Gs, [2.125] 


af 


te dx 


+e 
) = n im f V()v(x)dx. [2.124] 


dx h? 
For the case at hand (Equation 2.122), 


dy/dx — —Bke **, for (x >0). so dw/dx|, = —Bx, 
dy/dx —--Bke***, for (x <0), sody/dx| =+Bk. 


and hence A(dy/dx) = —2Bx. And y (0) = B. So Equation 2.125 says 


ma 


Tu [2.126] 
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and the allowed energy (Equation 2.117) is 


E h? ma? 127 
~~ Om OR i 


Finally, we normalize y: 


+00 2 
[. we? ax =B [^ e dy = Es = 


so (choosing, for convenience, the positive real root): 


M. 


B — yk — [2.128] 


Evidently the delta-function well, regardless of its "strength" o, has exactly one 
bound state: 


vma g nelst/m. 
$ 
h 


w(x) = f=. [2.129] 


What about scattering states, with E > 0? For x < 0 the Schrédinger equation 
reads 


2 
ote _2mE eye EN ip 
where 
k= p [2.130] 
is real and positive. The general solution is 
V (x) = Ae + Be ™, [2.131] 


and this time we cannot rule out either term, since neither of them blows up. 
Similarly, for x > 0, l l 
w(x) = Fe + Ge, [2.132] 


The continuity of w(x) at x = 0 requires that 
F+G=A+B. [2.133] 
The derivatives are 


dy/dx — ik(Fe'* — Ge"). for (x 20), sody/dx|, =ik(F — G), 
dy/dx — ik (Ae — Be), for (x <0), sody/dx|. — ik(A — B). 
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and hence A(dy//dx) = ik(F — G — A + B). Meanwhile, y/ (0) = (A+ B), so the 
second boundary condition (Equation 2.125) says 
2ma 


ik(P -G-A* B) 75 
2 


(A + B). [2.134] 


or, more compactly, 


F—G-A( + 2i8) - B(1 — 2iB). where B = v [2.135] 
i 
Having imposed both boundary conditions, we are left with two equations 
(Equations 2.133 and 2.135) in four unknowns (A, B, F, and G)—five, if you 
count k. Normalization won't help—this isn't a normalizable state. Perhaps we'd 
better pause, then, and examine the physical significance of these various con- 
stants. Recall that exp(ikx) gives rise (when coupled with the time-dependent 
factor exp(—i Et/h)) to a wave function propagating to the right, and exp(—ikx) 
leads to a wave propagating to the left. It follows that A (in Equation 2.131) is the 
amplitude of a wave coming in from the left, B is the amplitude of a wave return- 
ing to the left, F (Equation 2.132) is the amplitude of a wave traveling off to the 
right, and G is the amplitude of a wave coming in from the right (see Figure 2.15). 
In a typical scattering experiment particles are fired in from one direction— let's 
say, from the left. In that case the amplitude of the wave coming in from the right 
will be zero: 
G — 0. (for scattering from the left); [2.136] 


A is the amplitude of the incident wave, B is the amplitude of the reflected wave, 
and F is the amplitude of the transmitted wave. Solving Equations 2.133 and 
2.135 for B and F, we find 


] l 
LP 


B= jo uL A. 
1 — ig [^ 


[2:157] 


(If you want to study scattering from the right, set A = 0; then G is the incident 
amplitude, F is the reflected amplitude, and B is the transmitted amplitude.) 


FIGURE 2.15: Scattering from a delta func- 
tion well. 
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Now, the probability of finding the particle at a specified location is given by 

|V |?, so the relative?" probability that an incident particle will be reflected back is 
-BE E 

JAP EL 


[2.138] 


R is called the reflection coefficient. (If you have a beam of particles, it tells 
you the fraction of the incoming number that will bounce back.) Meanwhile, the 
probability of transmission is given by the transmission coefficient 


F|? l 
= | | = ——_., [2.139] 
[AP 1+? 
Of course, the sum of these probabilities should be 1—and it is: 
R+T =]. [2.140] 


Notice that R and T are functions of f, and hence (Equations 2.130 and 2.135) 
of E: 


l 
: [2.141] 


-LÁÉ——-—,.T-2—————. 
1+ QI E/ma?): 1+ (na? /2h? E) 


The higher the energy, the greater the probability of transmission (which certainly 
seems reasonable). 

This is all very tidy, but there is a sticky matter of principle that we cannot 
altogether ignore: These scattering wave functions are not normalizable, so they 
don't actually represent possible particle states. But we know what the resolution to 
this problem is: We must form normalizable linear combinations of the stationary 
states, just as we did for the free particle— true physical particles are represented 
by the resulting wave packets. Though straightforward in principle, this is a messy 
business in practice, and at this point it is best to turn the problem over to a 
computer.?? Meanwhile, since it is impossible to create a normalizable free-particle 
wave function without involving a range of energies, R and T should be interpreted 
as the approximate reflection and transmission probabilities for particles in the 
vicinity of E. 

Incidentally, it might strike you as peculiar that we were able to analyze a 
quintessentially time-dependent problem (particle comes in, scatters off a potential, 


"This is not a normalizable wave function. so the absolute probability of finding the particle 
at a particular location is not well defined; nevertheless. the ratio of probabilities for the incident and 
reflected waves is meaningful. More on this in the next paragraph. 

38 Numerical studies of wave packets scattering off wells and barriers reveal extraordinarily rich 
structure. The classic analysis is A. Goldberg, H. M. Schey. and J. L. Schwartz, Am. J. Phys. 35, 177 
(1967); more recent work can be found on the Web. 
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V(x) = 0 8(x) 


x FIGURE 2.16: The delta-function barrier. 


and flies off to infinity) using stationary states. After all, w (in Equations 2.131 
and 2.132) is simply a complex, time-independent, sinusoidal function, extending 
(with constant amplitude) to infinity in both directions. And yet, by imposing 
appropriate boundary conditions on this function we were able to determine the 
probability that a particle (represented by a localized wave packet) would bounce 
off, or pass through, the potential. The mathematical miracle behind this is, I 
suppose, the fact that by taking linear combinations of states spread over all space, 
and with essentially trivial time dependence, we can construct wave functions that 
are concentrated about a (moving) point, with quite elaborate behavior in time (see 
Problem 2.43). 

As long as we've got the relevant equations on the table, let's look briefly at 
the case of a delta-function barrier (Figure 2.16). Formally, all we have to do is 
change the sign of o. This kills the bound state, of course (Problem 2.2). On the 
other hand, the reflection and transmission coefficients, which depend only on o", 
are unchanged. Strange to say, the particle is just as likely to pass through the barrier 
as to cross over the well! Classically, of course, a particle cannot make it over an 
infinitely high barrier, regardless of its energy. In fact, classical scattering problems 
are pretty dull: If E > Vmax, then T = 1 and R = 0—the particle certainly 
makes it over; if E < Vmax then T = 0 and R = 1—it rides up the hill until 
it runs out of steam, and then returns the same way it came. Quantum scattering 
problems are much richer: The particle has some nonzero probability of passing 
through the potential even if E < Vmax. We call this phenomenon tunneling; it is 
the mechanism that makes possible much of modern electronics — not to mention 
spectacular advances in microscopy. Conversely, even if E > Vmax there is a 
possibility that the particle will bounce back— though I wouldn't advise driving 
off a cliff in the hope that quantum mechanics will save you (see Problem 2.35). 


xProblem 2.23 Evaluate the following integrals: 


(a) f * G3? —3x? + 2x — D&(x + 2) dx. 
(b) Jy [cos(3x) + 2]d(x — x) dx. 
(c). m exp(|x| + 3ó(x — 2) dx. 
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Problem 2.24 Delta functions live under integral signs, and two expressions (Dj (x) 
and D2(x)) involving delta functions are said to be equal if 


+00 Too 


f)Di(x)dx = f (x) D2(x) dx. 


—00 =% 
for every (ordinary) function f(x). 
(a) Show that 


ó(cx) = O] [2.142] 


where c is a real constant. (Be sure to check the case where c is negative.) 


(b) Let 0(x) be the step function: 


l, dx 


0. dee. [2.143] 


0(x)z | 
(In the rare case where it actually matters, we define 8 (0) to be 1/2.) Show 
that dð /dx = ô(x). 


x *Problem 2.25 Check the uncertainty principle for the wave function in 
Equation 2.129. Hint: Calculating (p?) is tricky, because the derivative of y has 
a step discontinuity at x = 0. Use the result in Problem 2.24(b). Partial answer: 


(p?) = (mo/ny.. 


*Problem 2.26 What is the Fourier transform of 8 (x)? Using Plancherel’s theorem, 
show that 
l Too X 
8(x) = — J e™ dk. [2.144] 
ZA Joo 
Comment: This formula gives any respectable mathematician apoplexy. Although 
the integral is clearly infinite when x = 0, it doesn’t converge (to zero or any- 
thing else) when x # 0, since the integrand oscillates forever. There are ways 
to patch it up (for instance, you can integrate from —L to +L, and interpret 
Equation 2.144 to mean the average value of the finite integral, as L — oo). 
The source of the problem is that the delta function doesn't meet the requirement 
(square-integrability) for Plancherel's theorem (see footnote 33). In spite of this, 
Equation 2.144 can be extremely useful, if handled with care. 


xProblem 2.27 Consider the double delta-function potential 
V(x) 2 —a[8(x + a) + d(x — a)]. 


where o and a are positive constants. 
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(a) Sketch this potential. 


(b) How many bound states does it possess? Find the allowed energies, for a = 
h?/ ma and for œ = ħ?/4ma, and sketch the wave functions. 


x xProblem 2.28 Find the transmission coefficient for the potential in Problem 2.27. 


2.6 THE FINITE SQUARE WELL 


As a last example, consider the finite square well potential 


A] ^W. for -a<x <a, 


where Vo is a (positive) constant (Figure 2.17). Like the delta-function well, this 
potential admits both bound states (with E < 0) and scattering states (with E > 0). 
We'll look first at the bound states. 

In the region x « —a the potential is zero, so the Schródinger equation reads 


h dey Py» 
= = Ev. = ky, 
2m dx? Ve D dx? did 
where 
ga E [2.146] 


is real and positive. The general solution is y (x) = A exp(—«x) + B exp(«x), but 
the first term blows up (as x — —oo), so the physically admissible solution (as 
before—see Equation 2.119) is 


w(x) = Be", forx « —a. [2.147] 


FIGURE 2.17: The finite square well 
(Equation 2.145). 
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In the region —a < x < a, V(x) = —Vo, and the Schrödinger equation reads 
ne dy d a 
—-—— — Voy = Ey. --—l^y, 
2m dx? ov ye ot dx? y 
where 
J/2m(E + V 
l= one [2.148] 


Although E is negative, for bound states, it must be greater than — Vo, by the 
old theorem E > Vmin (Problem 2.2); so / is also real and positive. The general 
solution is?? 


w(x) = C sin(lx) + D cos(lix), for -a<x <a, [2.149] 


where C and D are arbitrary constants. Finally, in the region x > a the potential 
is again zero; the general solution is w(x) = F exp(—«x) + G exp(kx), but the 
second term blows up (as x — oo), so we are left with 


y(x) = Fe™™. forx»a. [2.150] 


The next step is to impose boundary conditions: y and dy/dx continuous at 
—a and +a. But we can save a little time by noting that this potential is an even 
function, so we can assume with no loss of generality that the solutions are either 
even or odd (Problem 2.1(c)). The advantage of this is that we need only impose 
the boundary conditions on one side (say, at +a); the other side is then automatic, 
since w(—x) = y(x). I'll work out the even solutions; you get to do the odd 
ones in Problem 2.29. The cosine is even (and the sine is odd), so I’m looking for 
solutions of the form 


Fe, for x >a. 
V(x)—4|1 Dcos(ix). for0«x «a. [2.151] 
y (—x). for x « 0. 


The continuity of y (x), at x =a, says 
Fe ** = Dcos(la). [2.152] 
and the continuity of dy/dx, says 
—kFe ** = —[Dsin(la). [2.153] 
Dividing Equation 2.153 by Equation 2.152, we find that 
k — ltan(la). [2.154] 


39 You can. if you like, write the general solution in exponential form (C'ei* + D'e7!*). This 
leads to the same final result. but since the potential is symmetric we know the solutions will be either 
even or odd. and the sine/cosine notation allows us to exploit this directly. 
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V (zo/z)*-1 


n/2 n 3n/2 Qn 57/2 Zz 
FIGURE 2.18: Graphical solution to Equation 2.156, for zo = 8 (even states). 


This is a formula for the allowed energies, since x and / are both functions 
of E. To solve for E, we first adopt some nicer notation: Let 


z=la, and zo = $ V 2m Vo. [2.155] 
1 


According to Equations 2.146 and 2.148, (x? + |?) = 2m Vo/ħ?, so ka = NET =z}, ` 


and Equation 2.154 reads 
tanz = V (zo/z)? — 1. [2.156] 


This is a transcendental equation for z (and hence for E) as a function of zo 
(which is a measure of the “size” of the well). It can be solved numerically, using 
a computer, or graphically, by plotting tanz and ./(zo/z)? — 1 on the same grid, 
and looking for points of intersection (see Figure 2.18). Two limiting cases are of 
special interest: 

1. Wide, deep well. If zo is very large, the intersections occur just slightly 
below z, = nz/2, with n odd; it follows that 

"ES n?n h? l 

E, + Vo = may [2.157] 
But E + Vo is the energy above the bottom of the well, and on the right side 
we have precisely the infinite square well energies, for a well of width 2a (see 
Equation 2.27)—or rather, half of them, since this n is odd. (The other ones, of 
course, come from the odd wave functions, as you'll discover in Problem 2.29.) So 
the finite square well goes over to the infinite square well, as Vy — oo; however, 
for any finite Vo there are only a finite number of bound states. 

2. Shallow, narrow well. As zo decreases, there are fewer and fewer bound 
states, until finally (for zo < 2/2, where the lowest odd state disappears) only one 
remains. It is interesting to note, however, that there is always one bound state, no 
matter how “weak” the well becomes. 
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You're welcome to normalize y (Equation 2.151), if you're interested 
(Problem 2.30), but I'm going to move on now to the scattering states (E > 0). 
To the left, where V(x) = 0, we have 


y (x) = Ae + Be, for (x < —a). [2.158] 
where (as usual) 
J2m E 
p= = [2.159] 


Inside the well, where V(x) = —Vo, 
w(x) = C sin(lx) + D cos(lx), for (-a <x <a), [2.160] 


where, as before, 


i= one [2.161] 
1 


To the right, assuming there is no incoming wave in this region, we have 
w(x) = Fel". [2.162] 
Here À is the incident amplitude, B is the reflected amplitude, and F is the trans- 
mitted amplitude. ^? 
There are four boundary conditions: Continuity of y (x) at —a says 
Ae !*^ + Be? = —C sin(la) + D cos(la). [2.163] 
continuity of dy/dx at —a gives 
ik{Aei*¢ — Be] = I(C cos(la) + D sin(la)] [2.164] 
continuity of y(x) at +a yields 
C sin(la) + D cos(la) = Fe, [2.165] 
and continuity of dy/dx at +a requires 


I[C cos(la) — D sin(la)] = ik Fe“. [2.166] 


We could look for even and odd functions. as we did in the case of bound states. but the 
scattering problem is inherently asymmetric, since the waves come in from one side only, and the 
exponential notation (representing traveling waves) is more natural in this context. 
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FIGURE 2.19: Transmission coefficient as a function of energy (Equation 2.169). 


We can use two of these to eliminate C and D, and solve the remaining two for 
B and F (see Problem 2.32): 


.sin(2/a) 5 A 
= | ——— (l4 — k*) F, . 
B=i 2 ( k^) [2.167] 
—2ika 
Ee. ies [2.168] 


cos(2la) — i EHP sin(2la) 


The transmission coefficient (T = |F|?/|A|*), expressed in terms of the orig- 
inal variables, is given by 
2 


V- 
T^ 214 — sin? (Fv ame + vo) À [2.169] 


4E(E+Vo) h 
Notice that T = | (the well becomes "transparent") whenever the sine is zero, 


which is to say, when 
2 
Ey 2m(En + Vo) — nz. [2.170] 
1 


where n is any integer. The energies for perfect transmission, then, are given by 
noah? 
2m (2a)? 
which happen to be precisely the allowed energies for the infinite square well. T 
is plotted in Figure 2.19, as a function of energy.*! 


E, + Vo = [2.171] 


xProblem 2.29 Analyze the odd bound state wave functions for the finite square 


well. Derive the transcendental equation for the allowed energies, and solve it 
graphically. Examine the two limiting cases. Is there always an odd bound state? 


+! This remarkable phenomenon has been observed in the laboratory, in the form of the Ramsauer- 
Townsend effect. For an illuminating discussion see Richard W. Robinett. Quantum Mechanics, Oxford 
U.P.. 1997. Section 12.4.1. 
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Problem 2.30 Normalize y(x) in Equation 2.151, to determine the constants D 
and F. 


Problem 2.31 The Dirac delta function can be thought of as the limiting case of a 
rectangle of area 1, as the height goes to infinity and the width goes to zero. Show 
that the delta-function well (Equation 2.114) is a “weak” potential (even though it 
is infinitely deep), in the sense that zy — 0. Determine the bound state energy for 
the delta-function potential, by treating it as the limit of a finite square well. Check 
that your answer is consistent with Equation 2.129. Also show that Equation 2.169 
reduces to Equation 2.141 in the appropriate limit. 


Problem 2.32 Derive Equations 2.167 and 2.168. Hint: Use Equations 2.165 and 
2.166 to solve for C and D in terms of F: 


k " k " 
C= sin a) + i7 cos(a) e"F D= [costi — [y siad) el F, 


Plug these back into Equations 2.163 and 2.164. Obtain the transmission coefficient, 
and confirm Equation 2.169. 


x xProblem 2.33 Determine the transmission coefficient for a rectangular barrier 
(same as Equation 2.145, only with V(x) = +Vo > 0 in the region —a < x < a). 
Treat separately the three cases E < Vo, E = Vo, and E > Vp (note that the 
wave function inside the barrier is different in the three cases). Partial answer: For 
EV 


V2 2 
T = 1 + ——9—— sinh? (Svan = E) 


si 
4E(Vo — E) h 


xProblem 2.34 Consider the “step” potential: 


0. ifx <0. 
Vx) = | Vo. ifx >00. 


(a) Calculate the reflection coefficient, for the case E < Vo, and comment on 
the answer. 


(b) Calculate the reflection coefficient for the case E > Vo. 


(c) For a potential such as this, which does not go back to zero to the right of 
the barrier, the transmission coefficient is not simply |F|?/|A|? (with A the 


J?This is a good example of tunneling— classically the particle would bounce back. 
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V(x) 


FIGURE 2.20: Scattering from a "cliff" (Problem 2.35). 


(d) 


incident amplitude and F the transmitted amplitude), because the transmitted 
wave travels at a different speed. Show that 
E — Vo |F]? 
T= o | B 
E  |Al* 
for E > Vo. Hint: You can figure it out using Equation 2.98, or— more ele- 


gantly, but less informatively —from the probability current (Problem 2.19). 
What is T, for E « Vo? 


[2.172] 


For E > Vo, calculate the transmission coefficient for the step potential, and 
check that T + R — 1. 


Problem 2.35 A particle of mass m and kinetic energy E > 0 approaches an 
abrupt potential drop Vo (Figure 2.20). 


(a) 


(b) 


(c) 


What is the probability that it will "reflect" back, if E = Vo/3? Hint: This 
is just like Problem 2.34, except that the step now goes down, instead of up. 


I drew the figure so as to make you think of a car approaching a cliff, but 
obviously the probability of "bouncing back" from the edge of a cliff is far 
smaller than what you got in (a)—unless you're Bugs Bunny. Explain why 
this potential does not correctly represent a cliff. Hint: In Figure 2.20 the 
potential energy of the car drops discontinuously to — Vo, as it passes x — 0; 
would this be true for a falling car? 


When a free neutron enters a nucleus, it experiences a sudden drop in poten- 
tial energy. from V — 0 outside to around —12 MeV (million electron volts) 
inside. Suppose a neutron, emitted with kinetic energy 4 MeV by a fission 
event, strikes such a nucleus. What is the probability it will be absorbed, 
thereby initiating another fission? Hint: You calculated the probability of 
reflection in part (a); use T — 1 — R to get the probability of transmission 
through the surface. 


Further Problems for Chapter 2 85 


FURTHER PROBLEMS FOR CHAPTER 2 


Problem 2.36 Solve the time-independent Schrédinger equation with appropri- 
ate boundary conditions for the "centered" infinite square well: V(x) = O (for 
—a < x < +a), V(x) = oo (otherwise). Check that your allowed energies are 
consistent with mine (Equation 2.27), and confirm that your y/'s can be obtained 
from mine (Equation 2.28) by the substitution x — (x + a)/2 (and appropriate 
renormalization). Sketch your first three solutions, and compare Figure 2.2. Note 
that the width of the well is now 2a. 


Problem 2.37 A particle in the infinite square well (Equation 2.19) has the initial 
wave function 


V(x.0)—A sin? (rx /a) (0 € x <a). 


Determine A, find V(x.7), and calculate (x), as a function of time. What is the 
expectation value of the energy? Hirt: sin" 0 and cos" 0 can be reduced, by repeated 
application of the trigonometric sum formulas, to linear combinations of sin(m6) 
and cos(710), with m — 0.1.2. ....n. 


xProblem 2.38 A particle of mass m is in the ground state of the infinite square well 
(Equation 2.19). Suddenly the well expands to twice its original size—the right 
wall moving from a to 2a —leaving the wave function (momentarily) undisturbed. 
The energy of the particle is now measured. 


(a) What is the most probable result? What is the probability of getting that 
result? 


(b) What is the next most probable result, and what is its probability? 


(c) What is the expectation value of the energy? Hint: If you find yourself 
confronted with an infinite series, try another method. 


Problem 2.39 


(a) Show that the wave function of a particle in the infinite square well returns 
to its original form after a quantum revival time T = 4ma?/zh. That is: 
V (x. T) = V(x.0) for any state (ror just a stationary state). 


(b) What is the classical revival time, for a particle of energy E bouncing back 
and forth between the walls? 


(c) For what energy are the two revival times equal?*? 


*3The fact that the classical and quantum revival times bear no obvious relation to one another 
(and the quantum one doesn't even depend on the energy) is a curious paradox: see Daniel Styer. 
Am. J. Phys. 69, 56 (2001). 
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Problem 2.40 A particle of mass m is in the potential 


oo (x « 0), 
Vix) = { —32ħ?/ma? (0xx <a). 
0 (x >a). 


(a) How many bound states are there? 


(b) In the highest-energy bound state, what is the probability that the particle 
would be found outside the well (x > a)? Answer: 0.542, so even though it 
is “bound” by the well, it is more likely to be found outside than inside! 


Problem 2.41 A particle of mass m in the harmonic oscillator potential 
(Equation 2.43) starts out in the state 


for some constant A. 


(a) What is the expectation value of the energy? 


(b) At some later time T the wave function is 


2 
Wix,T)=B (1 +2 Ez) eg WA 


for some constant B. What is the smallest possible value of 7? 


Problem 2.42 Find the allowed energies of the half harmonic oscillator 


(1/2)mo* x?. for x > 0. 
co. for x « 0. 


V(x) = | 


(This represents, for example. a spring that can be stretched, but not compressed.) 
Hint: This requires some careful thought, but very little actual computation. 


x xProblem 2.43 In Problem 2.22 you analyzed the stationary gaussian free particle 
wave packet. Now solve the same problem for the traveling gaussian wave packet, 
starting with the initial wave function 


Vx, 0) = Aee e'l, 


where / is a real constant. 
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x «Problem 2.44 Solve the time-independent Schrödinger equation for a centered 
infinite square well with a delta-function barrier in the middle: 


ad(x). for —a <x < +4, 
co, for |x| > a. 


V(x) - 


Treat the even and odd wave functions separately. Don't bother to normalize them. 
Find the allowed energies (graphically, if necessary). How do they compare with 
the corresponding energies in the absence of the delta function? Explain why the 
odd solutions are not affected by the delta function. Comment on the limiting cases 
a — 0 and a > co. 


Problem 2.45 If two (or more) distinct solutions to the (time-independent) 
Schródinger equation have the same energy E, these states are said to be degen- 
erate. For example, the free particle states are doubly degenerate—one solution 
representing motion to the right. and the other motion to the left. But we have never 
encountered normalizable degenerate solutions, and this is no accident. Prove the 
following theorem: In one dimension? there are no degenerate bound states. Hint: 
Suppose there are two solutions, y; and y», with the same energy E. Multiply the 
Schrödinger equation for y, by y2, and the Schrödinger equation for y? by y1, 
and subtract, to show that (yodyi/dx — yidvya/dx) is a constant. Use the fact 
that for normalizable solutions yy — 0 at too to demonstrate that this constant is 
in fact zero. Conclude that y is a multiple of yı, and hence that the two solutions 
are not distinct. 


Problem 2.46 Imagine a bead of mass m that slides frictionlessly around a circular 
wire ring of circumference L. (This is just like a free particle, except that w(x + 
L) = vy (x).) Find the stationary states (with appropriate normalization) and the 
corresponding allowed energies. Note that there are two independent solutions for 
each energy E, — corresponding to clockwise and counter-clockwise circulation; 
call them V (x) and y7 (x). How do you account for this degeneracy, in view of 
the theorem in Problem 2.45 (why does the theorem fail, in this case)? 


x xProblem 2.47 Attention: This is a strictly qualitative problem—no calculations 
allowed! Consider the “double square well" potential (Figure 2.21). Suppose the 


+41f two solutions differ only by a multiplicative constant (so that, once normalized. they differ 
only by a phase factor el), they represent the same physical state. and in this sense they are nor distinct 
solutions. Technically, by “distinct” I mean “linearly independent.” 


In higher dimensions such degeneracy is very common. as we shall see in Chapter 4. Assume 
that the potential does not consist of isolated pieces separated by regions where V = oo— two isolated 
infinite square wells. for instance, would give rise to degenerate bound states. for which the particle is 
cither in the one or in the other. 
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FIGURE 2.21: The double square well (Problem 2.47). 


depth Vo and the width a are fixed, and large enough so that several bound states 
occur. 


(a) Sketch the ground state wave function y and the first excited state yo, 
(i) for the case b = 0, (ii) for b © a, and (iii) for b > a. 


(b) Qualitatively, how do the corresponding energies (E and E?) vary, as b goes 
from 0 to co? Sketch Ej(b) and E2(b) on the same graph. 


(c) The double well is a very primitive one-dimensional model for the potential 
experienced by an electron in a diatomic molecule (the two wells represent 
the attractive force of the nuclei). If the nuclei are free to move, they will 
adopt the configuration of minimum energy. In view of your conclusions in 
(b), does the electron tend to draw the nuclei together, or push them apart? 
(Of course, there is also the internuclear repulsion to consider, but that's a 
separate problem.) 


Problem 2.48 In Problem 2.7(d) you got the expectation value of the energy by 
summing the series in Equation 2.39, but 1 warned you (in footnote 15) not to try it 
the “old fashioned way," (H) = f W(x. 0)* H W(x. 0) dx, because the discontinu- 
ous first derivative of Y (x. 0) renders the second derivative problematic. Actually, 
you could have done it using integration by parts, but the Dirac delta function 
affords a much cleaner way to handle such anomalies. 


(a) Calculate the first derivative of Y(x.0) (in Problem 2.7), and express the 
answer in terms of the step function, 0 (x — a/2), defined in Equation 2.143. 
(Don't worry about the end points — just the interior region 0 < x < a.) 


(b) Exploit the result of Problem 2.24(b) to write the second derivative of W(x, 0) 
in terms of the delta function. 


(c) Evaluate the integral f 9G.0)*H V (x, 0) dx, and check that you get the 
same answer as before. 
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x x «Problem 2.49 
(a) Show that 


mo mo fa, & ELT 
die (5) oak 3 (: aie Oo ee ee oe 220] 


satisfies the time-dependent Schródinger equation for the harmonic oscillator 
potential (Equation 2.43). Here a is any real constant with the dimensions of 
length.^$ 


(b) Find [V (x, t)|?, and describe the motion of the wave packet. 
(c) Compute (x) and (p), and check that Ehrenfest' s theorem (Equation 1.38) is 
satisfied. 
x xProblem 2.50 Consider the moving delta-function well: 
V(x, t) = —aó(x — vt), 
where v is the (constant) velocity of the well. 
(a) Show that the time-dependent Schródinger equation admits the exact solution 


W(x. t) = v T emery wilh? t-il UO /2)mv?yr mx] /h 
1 


where E — —ma?/2h? is the bound-state energy of the stationary delta 
function. Hint: Plug it in and check it! Use the result of Problem 2.24(b). 


(b) Find the expectation value of the Hamiltonian in this state, and comment on 
the result. 


x * xProblem 2.51 Consider the potential 


2.2 


aye 


2 
sech“ (ax), 
m 


V(x) =- 
where a is a positive constant, and "sech" stands for the hyperbolic secant. 
(a) Graph this potential. 
(b) Check that this potential has the ground state 
Vo(x) = A sech(ax). 
and find its energy. Normalize vo, and sketch its graph. 


4©This rare example of an exact closed-form solution to the time-dependent Schrödinger equation 
was discovered by Schródinger himself. in 1926. 


90 


Chapter 2 Time-Independent Schrödinger Equation 


(c) Show that the function 
ik —atanh(ax)\ 4, 
GQ) =A) A ikx 
Vix) ( ik+a ) diis 
(where k = 2i E/h, as usual) solves the Schrödinger equation for any 
(positive) energy E. Since tanh z —^ —1 as z + —oo, 


Wk(x) © Ae". for large negative x. 


This represents, then, a wave coming in from the left with no accompany- 
ing reflected wave (i.e., no term exp(—ikx)). What is the asymptotic form 
of y(x) at large positive x? What are R and T, for this potential? Com- 
ment: This is a famous example of a reflectionless potential — every incident 
particle, regardless of its energy, passes right through. . 


Problem 2.52 The scattering matrix. The theory of scattering generalizes in 
a pretty obvious way to arbitrary localized potentials (Figure 2.22). To the left 
(Region D, V (x) = 0, so 


J2mE 
= , [2.173] 
1 


y(x) = Ae* + Be~, where k = 


To the right (Region III), V(x) is again zero, so 
W(x) 2 Fe + Ge, [2.174] 
In between (Region ID, of course, I can't tell you what y is until you specify the 


potential, but because the Schródinger equation is a linear, second-order differential 
equation, the general solution has got to be of the form 


y (x) = Cf (x) + Dg(x). 


where f(x) and g(x) are two linearly independent particular solutions.*® There 
will be four boundary conditions (two joining Regions I and II, and two joining 


Aelkx V(x) Felkx 
———— ————ÀÓ 


Region | Region Il Region Ill 


FIGURE 2.22: Scattering from an arbitrary localized potential (V(x) = 0 except in 
Region II); Problem 2.52. 


YR, E. Crandall and B. R. Litt, Annals of Physics. 146. 458 (1983). 


BSce any book on differential equations — for example. J. L. Van Iwaarden. Ordinary Differential 
Equations with Numerical Techniques. Harcourt Brace Jovanovich, San Diego, 1985, Chapter 3. 
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Regions II and III). Two of these can be used to eliminate C and D, and the other 
two can be "solved" for B and F in terms of A and G: 


B=S,;A+S2G. F=S$213A+ 55G. 


The four coefficients 5;;, which depend on k (and hence on E), constitute a 2 x 2 
matrix S, called the scattering matrix (or S-matrix, for short). The S-matrix tells 
you the outgoing amplitudes (B and F) in terms of the incoming amplitudes (A 


and G): 
BY [Sun Si2\ (A 
(7) a n z) (6) [2.175] 


In the typical case of scattering from the left, G = 0, so the reflection and trans- 
mission coefficients are 


R; = BT eus de FE = Sn. [2.176] 

IAF lc=0 IAP Io 

For scattering from the right, A = 0, and 
ote) safe gall) nage [2.177] 
eS = il > ro TAD = il: : 

IGP yey IG? |a= 

(a) Construct the S-matrix for scattering from a delta-function well (Equa- 
tion 2.114), 


(b) Construct the S-matrix for the finite square well (Equation 2.145). Hint: This 
requires no new work, if you carefully exploit the symmetry of the problem. 


x x xProblem 2.53 The transfer matrix. The S-matrix (Problem 2.52) tells you the 
outgoing amplitudes (B and F) in terms of the incoming amplitudes (A and 
G)— Equation 2.175. For some purposes it is more convenient to work with the 
transfer matrix, M, which gives you the amplitudes to the right of the potential 
(F and G) in terms of those to the left (A and B): 


F = Mi: Mio A 
(&) zx a a (5) 3 [2.178] 


(a) Find the four elements of the M-matrix, in terms of the elements of the 
S-matrix, and vice versa. Express R;, 77, R,, and T; (Equations 2.176 and 
2.177) in terms of elements of the M-matrix. 


(b 


— 


Suppose you have a potential consisting of two isolated pieces (Figure 2.23). 
Show that the M-matrix for the combination is the product of the two 
M-matrices for each section separately: 


M = MM). [2.179] 


(This obviously generalizes to any number of pieces, and accounts for the 
usefulness of the M-matrix.) 
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M ow WT ee ee 


x 
—Ó — —ÓÁÀ—————áá 2 m 
V=0 V=0 V=0 


FIGURE 2.23: A potential consisting of two isolated pieces (Problem 2.53). 


(c) Construct the M-matrix for scattering from a single delta-function potential 
at point a: 


V(x) = —aó(x — a). 


(d) By the method of part (b), find the M-matrix for scattering from the double 
delta function 


V (x) = —al[ó(x +a) t-ó(x — a)]. 


What is the transmission coefficient for this potential? 


Problem 2.54 Find the ground state energy of the harmonic oscillator, to five sig- 
nificant digits, by the “wag-the-dog” method. That is, solve Equation 2.72 numer- 
ically, varying K until you get a wave function that goes to zero at large é. In 
Mathematica, appropriate input code would be 


Plot(Evaluate[u[x]/.NDSolve[(u"[x] -(x? - K)*u[x] == 0, u[0] == 1, 
u'[0] == 0), u[x], (x, 1073, 10), MaxSteps -> 10000]], (x, a, b], 
PlotRange -> (c, d]]; 
(Here (a. b) 1s the horizontal range of the graph, and (c, d) is the vertical range— 
start with a = 0, b = 10, c = —10, d = 10.) We know that the correct solution is 
K = 1, so you might start with a "guess" of K = 0.9. Notice what the “tail” of the 
wave function does. Now try K — 1.1, and note that the tail flips over. Somewhere 


in between those values lies the correct solution. Zero in on it by bracketing K ^ 


tighter and tighter. As you do so, you may want to adjust a, b, c, and d, to zero 
in on the cross-over point. 


Problem 2.55 Find the first three excited state energies (to five significant digits) 


for the harmonic oscillator, by wagging the dog (Problem 2.54). For the first (and 


third) excited state you will need to set u[0] == 0. 4'[0] == 1. 


Problem 2.56 Find the first four allowed energies (to five significant digits) for 
the infinite square well, by wagging the dog. Hint: Refer to Problem 2.54, making 
appropriate changes to the differential equation. This time the condition you are 
looking for is 4(1) = O. 


CHAPTER 3 


FORMALISM 


3.1 HILBERT SPACE 


In the last two chapters we have stumbled on a number of interesting properties of 
simple quantum systems. Some of these are “accidental” features of specific poten- 
tials (the even spacing of energy levels for the harmonic oscillator, for example), 
but others seem to be more general, and it would be nice to prove them once and 
for all (the uncertainty principle, for instance, and the orthogonality of stationary 
states). The purpose of this chapter is to recast the theory in a more powerful form, 
with that in mind. There is not much here that is genuinely new; the idea, rather, 
is to make coherent sense of what we have already discovered in particular cases. 

Quantum theory is based on two constructs: wave functions and operators. The 
state of a system is represented by its wave function, observables are represented 
by operators. Mathematically, wave functions satisfy the defining conditions for 
abstract vectors, and operators act on them as linear transformations. So the 
natural language of quantum mechanics is linear algebra.! 

But it is not, I suspect, a form of linear algebra with which you are immediately 
familiar. In an N-dimensional space it is simplest to represent a vector, |o), by the 
N-tuple of its components, (a,,), with respect to a specified orthonormal basis: 


al 
an 
la) > a=]. |. [3.1] 


an 


lr you have never studied linear algebra, you should read the Appendix before continuing. 
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The inner product, (o|8). of two vectors (generalizing the dot product in three 
dimensions) is the complex number, 


(a|B) = afb) -- a5b» +---+aybn. [3.2] 


Linear transformations, T, are represented by matrices (with respect to the specified 
basis), which act on vectors (to produce new vectors) by the ordinary rules of matrix 
multiplication: 


fi f2 co TIN al 
thi mho v bN a? 

IB) 2 Tl) >b=Ta=| . , |. [3.3] 
IN| İN2 c0 İNN aN 


But the "vectors" we encounter in quantum mechanics are (for the most part) 
functions, and they live in infinite-dimensional spaces. For them the N-tuple/matrix 
notation is awkward, at best, and manipulations that are well-behaved in the finite- 
dimensional case can be problematic. (The underlying reason is that whereas the 
finite sum in Equation 3.2 always exists, an infinite sum—or an integral —may not 
converge, in which case the inner product does not exist, and any argument involving 
inner products is immediately suspect.) So even though most of the terminology and 
notation should be familiar, it pays to approach this subject with caution. 

The collection of all functions of x constitutes a vector space, but for our 
purposes it is much too large. To represent a possible physical state, the wave 
function V must be normalized: 


f wes = |, 


The set of all square-integrable functions, on a specified interval, 


b 
f(x) such that | IEO? dx « oo. [3.4] 


a 
constitutes a (much smaller) vector space (see Problem 3.1(a)). Mathematicians 
call it L»(a. b); physicists call it Hilbert space. In quantum mechanics, then, 


Wave functions live in Hilbert space. [3.5] 


?For us. the limits (a and 5) will almost always be 3: oo. but we might as well keep things more 
general for the moment. 


"Technically. a Hilbert space is a complete inner product space, and the collection of square- 
integrable Tunctions is only one example of a Hilbert space—indeed, every finite-dimensional vector 
space is trivially a Hilbert space. But since L» is the arena of quantum mechanics, it's what physicists 
generally mean when they say "Hilbert spacc." By the way. the word complete here means that any 
Cauchy sequence of functions in Hilbert space converges to a function that is also in the space: it has no 
"holes" in it. just as the set of all real numbers has no holes (by contrast, the space of all polynomials, 
for example, like the set of all rational numbers. certainly does have holes in it). The completeness 
of a space has nothing to do with the completeness (same word. unfortunately) of a sert of functions, 
which is the property that any other function can be expressed as a linear combination of them. 
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We define the inner product of two functions, f(x) and g(x), as follows: 
(Fle) = f l Foyt dx. [3.6] 
If f and g are both square-integrable (that is, if they are both in Hilbert space), 


their inner product is guaranteed to exist (the integral in Equation 3.6 converges to 
a finite number).* This follows from the integral Schwarz inequality: 


b b 
f |f GO dx | podr: [3.7] 


a a 


b 
f fœ) g(x) dx| < 


You can check for yourself that Equation 3.6 satisfies all the conditions for an inner 
product (Problem 3.1(b)). Notice in particular that 


(gif) = (f lg)”. [3.8] 


Moreover, the inner product of f(x) with itself, 


b 
vus f IOP as. [39] 
a 
is real and non-negative; it's zero only? when f(x) = 0. 
A function is said to be normalized if its inner product with itself is 1; two 
functions are orthogonal if their inner product is 0; and a set of functions, {fa}, 
is orthonormal if they are normalized and mutually orthogonal: 


(finl fn) = Ôn- [3.10] 


Finally, a set of functions is complete if any other function (in Hilbert space) càn 
be expressed as a linear combination of them: 


oo 


foe» es @) [3.11] 


n=l 


tin Chapter 2 we were obliged on occasion to work with functions that were nor normalizable. 
Such functions lie outside Hilbert space, and we are going to have to handle them with special care, as 
you will see shortly. For the moment, 1 shall assume that all the functions we encounter are in Hilbert 
space, 

For a proof, see F. Riesz and B. Sz.-Nagy. Functional Analysis (Unger, New York, 1955), 
Section 21. In a finite dimensional vector space the Schwarz inequality, Io B)? < (aja) (BIB). is 
easy to prove (see Problem A.5). But that proof assumes the existence of the inner products, which is 
precisely what we are trying to establish here. 


SWhat about a function that is zero everywhere except at a few isolated points? The integral 
(Equation 3.9) would still vanish, even though the function itself does not. If this bothers you, you 
should have been a math major. In physics such pathological functions do not occur, but in any case, in 
Hilbert space two functions that have the same square integral are considered equivalent. Technically, 
vectors in Hilbert space represent equivalence classes of functions. 
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If the functions ( fj, (4)) are orthonormal, the coefficients are given by Fourier's trick: 


Cn = (flf). [3.12] 


as you can check for yourself. I anticipated this terminology, of course, back 
in Chapter 2. (The stationary states for the infinite square well (Equation 2.28) 
constitute a complete orthonormal set on the interval (0. a); the stationary states 
for the harmonic oscillator (Equation 2.67 or 2.85) are a complete orthonormal set 
on the interval (—oo. oo).) 


Problem 3.1 


(a) Show that the set of all square-integrable functions is a vector space (refer 
to Section A.1 for the definition). Hint: The main problem is to show that 
the sum of two square-integrable functions is itself square-integrable. Use 
Equation 3.7. Is the set of all normalized functions a vector space? 


(b) Show that the integral in Equation 3.6 satisfies the conditions for an inner 
product (Section A.2). 


xProblem 3.2 


(a) For what range of v is the function f(x) — x" in Hilbert space, on the 
interval (0. 1)? Assume v is real, but not necessarily positive. 

(b) For the specific case v — 1/2, is f(x) in this Hilbert space? What about 
xf (x)? How about (d/dx) f (x)? 


3.2 OBSERVABLES 


3.2.1 Hermitian Operators 


The expectation value of an observable Q(x. p) can be expressed very neatly in 
inner-product notation: 


(Q) = [ vod = (VOW). [3.13] 


7Remember that Ô is the operator constructed from Q by the replacement p > f = (h/i)d/dx. 
These operators are linear. in the sense that 


Olaf (x) T bg(x)] = aQ f (x) + bQg(x). 


for any functions f and g and any complex numbers a and b. They constitute linear transformations 
(Section A.3) on the space of all Tunctions. However. they sometimes carry a function inside Hilbert 
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Now, the outcome of a measurement has got to be real, and so, a fortiori, is the 
average of many measurements: 


(Q) = (Q)*. [3.14] 
But the complex conjugate of an inner product reverses the order (Equation 3.8), so 
(v|QV) = (QW|V). [3.15] 


and this must hold true for any wave function Y. Thus operators representing 
observables have the very special property that 


(1Qf) =(OF\F) for all f(x). [3.16] 


We call such operators hermitian. 
Actually, most books require àn ostensibly stronger condition: 


(flÓg) =(Oflg) forall f (x) and all g(x). [3.17] 


But it turns out, in spite of appearances, that this is perfectly equivalent to my 
definition (Equation 3.16), as you will prove in Problem 3.3. So use whichever 
you like. The essential point is that a hermitian operator can be applied either to 
the first member of an inner product or to the second, with the same result, and 
hermitian operators naturally arise in quantum mechanics because their expectation 
values are real: 


Observables are represented by hermitian operators. [3.18] 


Well, let's check this. Is the momentum operator, for example, hermitian? 
oo X /ħdfN* n 
«f 63 gdx=(Ďflg). [3-19] 


I used integration by parts, of course, and threw away the boundary term for the 
usual reason: If f(x) and g(x) are square integrable, they must go to zero at + oo.? 


a ©% hd h 
(iba - [ fre dz f*g 
-% idx i 


-%9 —oo 


space into a function outside it (see Problem 3.2(b)). and in this cuse the domain of the operator may 
have to be restricted. 

8 Actually. this is not quite true. As I mention in Chapter 1. there exist pathological functions 
that are square-integrable but do zot go to zero at infinity. However, such functions do not arise in 
physics. and if you are worried about it we will simply restrict the domain of our operators to exclude 
them. On finite intervals. though. you really do have to be more careful with the boundary terms, 
and an operator that is hermitian on (—oo, 00) may not be hermitian on (0. co) or (—x, 7). If you're 
wondering about the infinite square well. it’s safest to think of those wave functions as residing on the 
infinite line —they just happen to be zerv outside (0. a). 
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Notice how the complex conjugation of i compensates for the minus sign picked 
up from integration by parts—the operator d/dx (without the 7) is not hermitian, 
and it does not represent a possible observable. 


«Problem 3.3 Show that if (h| Qh) = (Qh|h) for all functions A (in Hilbert space), 
then (f|Og) = (Ô f lg) for all f and g (i.e., the two definitions of “hermi- 
tian” — Equations 3.16 and 3.17—are equivalent). Hint: First let h = f + g, and 
then let h = f +ig. 


Problem 3.4 
(a) Show that the sum of two hermitian operators is hermitian. 
(b) Suppose Ô is hermitian, and œ is a complex number. Under what condition 
(on a) is « Q hermitian? 
(c) When is the product of two hermitian operators hermitian? 


(d) Show that the position operator (€ — x) and the hamiltonian operator (H= 
—(h? /2m)d? /dx? + V(x)) are hermitian. 


Problem 3.5 The hermitian conjugate (or adjoint) of an operator Ô is the oper- 
ator Q* such that 


(flÓg) = (Q'flg) (for all f and g). [3.20] 
) .) 


(A hermitian operator, then, is equal to its hermitian conjugate: Q = Ô 


L2 
t 


(a) Find the hermitian conjugates of x, i, and d/dx. 


(b) Construct the hermitian conjugate of the harmonic oscillator raising operator, 
a (Equation 2.47). 


(c) Show that (QR)! = R' Or. 


3.2.2 Determinate States 


Ordinarily, when you measure an observable Q on an ensemble of identically 
prepared systems, all in the same state V, you do not get the same result each 
time—this is the indeterminacy of quantum mechanics.? Question: Would it be 
possible to prepare a state such that every measurement of Q is certain to return 
the same value (call it g)? This would be, if you like, a determinate state, for 
the observable Q. (Actually, we already know one example: Stationary states are 
determinate states of the Hamiltonian; a measurement of the total energy, on a 


?l'm talking about competent measurements, of course—it's always possible to make a mistake. 
and simply get the wrong answer. but that's not the fault of quantum mechanics. 
ply g q 
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particle in the stationary state V, is certain to yield the corresponding “allowed” 
energy E,.) 

Well. the standard deviation of Q, in a determinate state, would be zero, which 
is to say, 


2 = (Q—(9Q))-(wi(Q-aqw)-(Q-aqwiQ-aq)w)-0. [321] 


(Of course, if every measurement gives q, their average is also q: (Q) — q. I also 
used the fact that Ô, and hence also Ô — q, is a hermitian operator, to move one 
factor over to the first term in the inner product.) But the only function whose inner 
product with itself vanishes is O, so 


Qv =qW. [3.22] 


This is the eigenvalue equation for the operator Ô; V is an eigenfunction of Q. 
and q is the corresponding eigenvalue. Thus 


Determinate states are eigenfunctions of Q. 


Measurement of Q on such a state is certain to yield the eigenvalue, q. 

Note that the eigenvalue is a number (not an operator or a function). You can 
multiply any eigenfunction by a constant, and it is still an eigenfunction, with the 
same eigenvalue. Zero does not count as an eigenfunction (we exclude it by defi- 
nition— otherwise every number would be an eigenvalue, since Ô0 = q0 — 0 for 
any operator Ô and all q). But there’s nothing wrong with zero as an eigenvalue. 
The collection of all the eigenvalues of an operator is called its spectrum. Some- 
times two (or more) linearly independent eigenfunctions share the same eigenvalue; 
in that case the spectrum is said to be degenerate. 

For example, determinate states of the total energy are eigenfunctions of the 
Hamiltonian: 


[3.23] 


Hy = Ey. [3.24] 


which is precisely the time-independent Schródinger equation. In this context we 
use the letter E for the eigenvalue, and the lower case y for the eigenfunction (tack 
on the factor exp(—i Et/h) to make it Y, if you like; it's still an eigenfunction 
of H). 


Example 3.1 Consider the operator 
Q =i—, [3.25] 
where 6 is the usual polar coordinate in two dimensions. (This operator might arise 


in a physical context if we were studying the bead-on-a-ring; see Problem 2.46.) 
Is Q hermitian? Find its eigenfunctions and eigenvalues. 
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Solution: Here we are working with functions f'($) on the finite interval 0 < $ < 
27, and stipulate that 


f(@+ 27) = f(@). [3.26] 
since @ and @ + 2x describe the same physical point. Using integration by parts, 
» 27 dg 21 27 df* " 
vió = [ P (E) ein -f i( ~) edd = (Ô lg). 
Í 0 do E: 0 0 dó ic 


so Q is hermitian (this time the boundary term disappears by virtue of Equation 3.26). 
The eigenvalue equation, 


E re e 
pU ($) = qf (0). [3.27] 

has the general solution 
f(b) = Ae 49. [3.28] 


Equation 3.26 restricts the possible values of the q: 
eIn =] = g=0,+1.42,... [3.29] 


The spectrum of this operator is the set of all integers, and it is nondegenerate. 


Problem 3.6 Consider the operator Ô = d?/dg*, where (as in Example 3.1) 
@ is the azimuthal angle in polar coordinates, and the functions are subject to 
Equation 3.26. Is Q hermitian? Find its eigenfunctions and eigenvalues. What is 
the spectrum of Ô? Is the spectrum degenerate? 


3.3 EIGENFUNCTIONS OF A HERMITIAN OPERATOR 


Our attention is thus directed to the eigenfunctions of hermitian operators (phys- 
ically: determinate states of observables). These fall into two categories: If the 
spectrum is discrete (i.e., the eigenvalues are separated from one another) then the 
eigenfunctions lie in Hilbert space and they constitute physically realizable states. 
If the spectrum is continuous (i.e., the eigenvalues fill out an entire range) then 
the eigenfunctions are not normalizable, and they do not represent possible wave 
functions (though linear combinations of them—involving necessarily a spread 
in eigenvalues—may be normalizable). Some operators have a discrete spectrum 
only (for example, the Hamiltonian for the harmonic oscillator), some have only a 
continuous spectrum (for example. the free particle Hamiltonian), and some have 
both a discrete part and a continuous part (for example, the Hamiltonian for a 
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finite square well). The discrete case is easier to handle, because the relevant inner 
products are guaranteed to exist—in fact, it is very similar to the finite-dimensional 
theory (the eigenvectors of a hermitian matrix). I'll treat the discrete case first, and 
then the continuous one. 


3.3.1 Discrete Spectra 


Mathematically, the normalizable eigenfunctions of a hermitian operator have two 
important properties: 


Theorem 1: Their eigenvalues are real. 
Proof: Suppose 
Of — qf. 
(i.e., f(x) is an eigenfunction of Q, with eigenvalue q), and!? 
(19) = (Qf) 
(Q is hermitian). Then 
qF) =a" (Ff) 


(q is a number, so it comes outside the integral, and because the first function 
in the inner product is complex conjugated (Equation 3.6), so too is the q on 
the right). But (f| f) cannot be zero (f(x) = 0 is not a legal eigenfunction), 
so q = q*, and hence q is real. QED 


This is comforting: If you measure an observable on a particle in a determinate 
state, you will at least get a real number. 


Theorem 2:  Eigenfunctions belonging to distinct eigenvalues are ortho- 
gonal. 


Proof: Suppose 
Qf —qf. and Óg-q'g. 
and Q is hermitian. Then (f|Og) = (Of lg), SO 
q' (flg) =4* (fle) 


(again, the inner products exist because the eigenfunctions are in Hilbert 
space by assumption). But q is real (from Theorem 1), so if q' Æ q it must 
be that (fle) — 0. QED 


10t is here that we assume the eigenfunctions are in Hilbert space —otherwise the inner product 
might not exist at all. 


102 


Chapter 3 Formalism 


That’s why the stationary states of the infinite square well, for example, or the 
harmonic oscillator, are orthogonal —they are eigenfunctions of the Hamiltonian 
with distinct eigenvalues. But this property is not peculiar to them, or even to the 
Hamiltonian—the same holds for determinate states of any observable. 
Unfortunately, Theorem 2 tells us nothing about degenerate states (g^ = q). 
However, if two (or more) eigenfunctions share the same eigenvalue, any lin- 


ear combination of them is itself an eigenfunction, with the same eigenvalue 


(Problem 3.7(a)), and we can use the Gram-Schmidt orthogonalization proce- 
dure (Problem A.4) to construct orthogonal eigenfunctions within each degenerate 
subspace. It is almost never necessary to do this explicitly (thank God!), but it can 
always be done in principle. So even in the presence of degeneracy the eigenfunc- 
tions can be c/iosen to be orthogonal, and in setting up the formalism of quantum 
mechanics we shall assume that this has already been done. That licenses the use 
of Fourier's trick, which depends on the orthonormality of the basis functions. 

In a finite-dimensional vector space the eigenvectors of a hermitian matrix 
have a third fundamental property: They span the space (every vector can be 
expressed as a linear combination of them). Unfortunately, the proof does not 
generalize to infinite-dimensional spaces. But the property itself is essential to the 
internal consistency of quantum mechanics, so (following Dirac!!) we will take it 
as an axiom (or, more precisely, as a restriction on the class of hermitian operators 


that can represent observables): 


Axiom: The eigenfunctions of an observable operator are complete: Any 
function (in Hilbert space) can be expressed as a linear combination of 
them. !? 


Problem 3.7 


(a) Suppose that f(x) and g(x) are two eigenfunctions of an operator Q, with 
the same eigenvalue q. Show that any linear combination of .f and g is itself 
an eigenfunction of Q. with eigenvalue q. 


(b) Check that f(x) = exp(x) and g(x) = exp(—x) are eigenfunctions of the 
operator d?/dx?, with the same eigenvalue. Construct two linear combina- 
tions of f and g that are orthogonal eigenfunctions on the interval (—1. 1). 


Ip. A. M. Dirac. The Principles of Quantum Mechanics. Oxford University Press, New York 
(1958). 


4 mm . " . 

lIn some specific cases completeness is provable (we know that the stationary states of thc 
infinite square well. for example. are complete, because of Dirichlet’s theorem). lt is a little awk- 
ward to call something an "axiom" that is provable in some cases. but | don't know a better way to 
handle it. 
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Problem 3.8 


(a) Check that the eigenvalues of the hermitian operator in Example 3.1 are real. 
Show that the eigenfunctions (for distinct eigenvalues) are orthogonal. 


(b) Do the same for the operator in Problem 3.6. 


3.3.2 Continuous Spectra 


If the spectrum of a hermitian operator is continuous, the eigenfunctions are not 
normalizable, and the proofs of Theorems 1 and 2 fail, because the inner products 
may not exist. Nevertheless, there is a sense in which the three essential properties 
(reality, orthogonality, and completeness) still hold. I think it's best to approach 
this subtle case through specific examples. 


Example 3.2 Find the eigenfunctions and eigenvalues of the momentum operator. 
Solution: Let fp(x) be the eigenfunction and p the eigenvalue: 

h d 

—— fp(X) = pfp(x). 3.30 

; qx dP ) Df ) [ ] 
The general solution is 

fp (x) = Ae?!” 

This is not square-integrable, for any (complex) value of p—the momentum oper- 
ator has no eigenfunctions in Hilbert space. And yet, if we restrict ourselves to 


real eigenvalues, we do recover a kind of ersatz “orthonormality.” Referring to 
Problems 2.24(a) and 2.26, 


co OO. A : 
J fo) fp) dx = ap f ei(P-PW/^ dx = |AP 2h (p — p). [3.31] 
-00 -XQ 


If we pick A = 1/4/27 i, so that 


\ ipx /li 
fpQ) Jat , [3.32] 
then 
(fp'l fp) = 8(p — p^. [3.33] 


which is strikingly reminiscent of true orthonormality (Equation 3.10)—the indices 
are now continuous variables, and the Kronecker delta has become a Dirac delta, 
but otherwise it looks just the same. Ill call Equation 3.33 Dirac orthonormality. 
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Most important, the eigenfunctions are complete, with the sum (in Equation 3.11) 
replaced by an integral: Any (square-integrable) function f(x) can be written in 
the form 


x)= c(p) fox) dp = —— c(p)el*/^ dp. 3.34 
fom f e fdp = —— | edp B34 
The expansion coefficient (now a function, c(p)) is obtained, as always, by Fourier’ s 
trick: 

oo 


ec 
(fpl) = f c(p) fpl fp) dp = / c(p)&(p — p)dp-—c(p). — 13.35] 

—co -œ 
Alternatively, you can get them from Plancherel’s theorem (Equation 2.102), for 
the expansion (Equation 3.34) is nothing but a Fourier transform. 


The eigenfunctions of momentum (Equation 3.32) are sinusoidal, with wave- 

length 
2xh 

ara 
This is the old de Broglie formula (Equation 1.39), which I promised to prove at 
the appropriate time. It turns out to be a little more subtle than de Broglie imagined, 
because we now know that there is actually no such thing as a particle with deter- 
minate momentum. But we could make a normalizable wave packet with a narrow 
range of momenta, and it is to such an object that the de Broglie relation applies. 

What are we to make of Example 3.2? Although none of the eigenfunctions 
of p lives in Hilbert space, a certain family of them (those with real eigenvalues) 
reside in the nearby “suburbs,” with a kind of quasi-normalizability. They do not 
represent possible physical states, but they are still very useful (as we have already 
seen, in our study of one-dimensional scattering).'? 


À [3.36] 


Example 3.3 Find the eigenfunctions and eigenvalues of the position operator. 


Solution: Let g,(x) be the eigenfunction and y the eigenvalue: 


X gy) = y gy (x). [3.37] 


I5What about the eigenfunctions with nonreal eigenvalues? These are not merely non- 
normalizable— they actually blow up at c oo. Functions in what | called the "suburbs" of Hilbert space 
(the entire metropolitan area is sometimes called a "rigged Hilbert space": see. for example, Leslie 
Ballentine’s Quantum Mechanics: A Modern Development, World Scientific, 1998) have the property 
that although they have no (finite) inner product with themselves, thcy do admit inner products with all 
members of Hilbert space. This is not true for eigenfunctions of f with nonreal eigenvalues. In particu- 
lar. I showed that the momentum operator is hermitian for functions in Hilbert space. but the argument 
depended on dropping the boundary term (in Equation 3.19). That term is still zero if g is an eigenfunc- 
tion of p with a real eigenvalue (as long as f is in Hilbert space), but not if the eigenvalue has an imag- 
inary part. In this sense any complex number is an eigenvalue of the operator p, but only real numbers 
are eigenvalues of the /termitian operator P—the others lic outside the space over which p is hermitian. 
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Here y is a fixed number (for any given eigenfunction), but x is a continuous 
variable. What function of x has the property that multiplying it by x is the same 
as multiplying it by the constant y? Obviously it's got to be zero, except at the one 
point x = y; in fact, it is nothing but the Dirac delta function: 


gy (x) = Ad(x — y). 


This time the eigenvalue has to be real; the eigenfunctions are not square-integrable, 
but again they admit Dirac orthonormality: 


oo oo 
J 8») By (x) dx = JA? J B(x — y')8(x — y) dx = lAPSQ — y’). [3-38] 
—oQ i -X 
If we pick A = 1, so 

g&y(x) = 9(x — y). [3.39] 


then 
(8y1gy) = (y — v). [3.40] 


These eigenfunctions are also complete: 


oo oo 
f(x)- J c(y) g; Cody = Í c(y)ó(x — v) dy, [3.41] 
-—O00 =O 
with 
c(y) = f(y) [3.42] 


(trivial, in this case, but you can get it from Fourier’s trick if you insist). 


If the spectrum of a hermitian operator is continuous (so the eigenvalues are 
labeled by a continuous variable— p or y. in the examples; z, generically, in what 
follows), the eigenfunctions are not normalizable, they are not in Hilbert space and 
they do not represent possible physical states; nevertheless, the eigenfunctions with 
real eigenvalues are Dirac orthonormalizable and complete (with the sum now an 
integral). Luckily, this is all we really require. 


Problem 3.9 


(a) Cite a Hamiltonian from Chapter 2 (other than the harmonic oscillator) that 
has only a discrete spectrum. 


(b) Cite a Hamiltonian from Chapter 2 (other than the free particle) that has only 
a continuous spectrum. 


106 Chapter 3 Formalism 


(c) Cite a Hamiltonian from Chapter 2 (other than the finite square well) that 
has both a discrete and a continuous part to its spectrum. 


Problem 3.10 Is the ground state of the infinite square well an eigenfunction of 
momentum? If so, what is its momentum? If not, why not? 


3.4 GENERALIZED STATISTICAL INTERPRETATION 


In Chapter 1 I showed you how to calculate the probability that a particle would be 
found in a particular location, and how to determine the expectation value of any 
observable quantity. In Chapter 2 you learned how to find the possible outcomes 
of an energy measurement and their probabilities. I am now in a position to state 
the generalized statistical interpretation, which subsumes all of this and enables 
you to figure out the possible results of any measurement, and their probabilities. 
Together with the Schródinger equation (which tells you how the wave function 
evolves in time) it is the foundation of quantum mechanics. 

Generalized statistical interpretation: If you measure an observable Q(x, p) 
on a particle in the state Y (x,t), you are certain to get one of the eigenvalues of 
the hermitian operator Q(x. —ihd /d x). If the spectrum of Ô is discrete, the prob- 
ability of getting the particular eigenvalue qn» associated with the orthonormalized 
eigenfunction f,(x) is 


le. where c, = (fal). [3.43] 


If the spectrum is continuous, with real eigenvalues q(z) and associated Dirac- 
orthonormalized eigenfunctions f.(x), the probability of getting a result in the 
range dz is 

lc(z) dz where c(z) = (f.|V). [3.44] 


Upon measurement, the wave function "collapses" to the corresponding eigen- 

14 
state. 

The statistical interpretation is radically different from anything we encounter 
in Classical physics. A somewhat different perspective helps to make it plausible: 
The eigenfunctions of an observable operator are complete, so the wave function 
can be written as a linear combination of them: 


Wx.) = 9 nfa). [3.45] 


lIn the case of continuous spectra the collapse is to a narrow range about the measured value, 
depending on the precision of the measuring device. 
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(For simplicity, I'll assume that the spectrum is discrete; it's easy to generalize this 
argument to the continuous case.) Because the eigenfunctions are orthonormal, the 
coefficients are given by Fourier's trick:'? 


= tw J ROVE, D dx. [3.46] 


Qualitatively, c, tells you “how much f» is contained in V," and given that a 
measurement has to return one of the eigenvalues of Q, it seems reasonable that 
the probability of getting the particular eigenvalue q, would be determined by the 
"amount of f," in V. But because probabilities are determined by the absolute 
square of the wave function, the precise measure is actually |c4|*. That's the 
essential burden of the generalized statistical interpretation.!Ó 
Of course, the total probability (summed over all possible outcomes) has got 
to be one: 
> lorek [3.47] 


HW 


and sure enough, this follows from the normalization of the wave function: 


L 


2 
= bs 2. Cj Cn Onn = yea = >) len l7. [3.48] 
n^ "Hn n 


n 


i = (Y|Y) = 5- cata | doen fu = epe Ul fn) 
n’ n n n 


Similarly, the expectation value of Q should be the sum over all possible outcomes 
of the eigenvalue times the probability of getting that eigenvalue: 


(Q) = alal [3.49] 
Indeed, 
(Q) = (WOW) =| [9 ewfe Jl O9 fs ] ). [3.50] 


I5Notice that the time dependence — which is not at issue here—is carried by the coefficients; 
to make this explicit. we should really write cj; (1). 

l6 Again, I am scrupulously avoiding the all-too-common assertion "Ic, |? is the probability that 
the particle is in the state fp.” This is nonsense. The particle is in the state Y, period. Rather, len|? is 
the probability that a measurement of Q would yield the value qy. It is true that such a measurement 
will collapse the state to the eigenfunction fj, so one could correctly say “|¢, |? is the probability that 
a particle which is now in the state V will be in the state fy subsequent to a measurement of Q”... 
but that's a completely different assertion. 
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but Q f, = Qn fn; so 


(Q) = bD > Cr CnQu fu’ fn) = >» SD cS ales. [3.51] 
n^ n n n" 


n 


So far, at least, everything looks consistent. 

Can we reproduce, in this language, the original statistical interpretation for 
position measurements? Sure—it’s real overkill, but worth checking. A measure- 
ment of x on a particle in state V must return one of the eigenvalues of the 
position operator. Well, in Example 3.3 we found that every (real) number y is an 
eigenvalue of x, and the corresponding (Dirac-orthonormalized) eigenfunction is 
gy (x) = 9(x — y). Evidently 


oo 
c(vy) = (g,]V) = | d(x — y)W(x.t) dx = W(y.t), [3.52] 


so the probability of getting a result in the range dy is |W(y. r)|? dy, which is 
precisely the original statistical interpretation. 

What about momentum? In Example 3.2 we found that the eigenfunctions of 
the momentum operator are f£, (x) = (1/4/2z f) exp(ipx/h), so 


] ME n 
c(p) = (fol¥) = x | e PU gx. 1) dx. [3.53] 


This is such an important quantity that we give it a special name and symbol: the 
momentum space wave function, (p. t). It is essentially the Fourier transform 
of the (position space) wave function V (x, t) — which, by Plancherel's theorem, 
is its inverse Fourier transform: 


1 "oO 
(p, i) = —— e PP Nye 0 dx: 3.54 
l co 
Wx, t) = e 5. t) dp. 3.55 
(x, f) JE a (p.t)dp [3.55] 


According to the generalized statistical interpretation, the probability that a mea- 
surement of momentum would yield a result in the range dp is 


Ie (p, D? dp. [3.56] 


Example 3.4 A particle of mass m is bound in the delta function well V(x) = 
—aó(x). What is the probability that a measurement of its momentum would yield 
a value greater than po = ma/h? 
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Solution: The (position space) wave function is (Equation 2.129) 


Vr) = NMA ine|s|/h? ,—i Et fh 
i h 
(where E = —ma*/2h*). The momentum space wave function is therefore 
3/2 —iEt/R 
O(p,t) = | ma eg iEUh L. g ipx/h o mo|x|/h* "ba ES oe 
Ja2ih h —oo X p? + Do 
(I looked up the integral). The probability, then, is 
oo 
2 i 1 1 
=P | ae o ce D z un^ 6 
5m po (p + po)* T | pe + 0 PO pi 


(again, I looked up the integral). 


Problem 3.11 Find the momentum-space wave function, c (p, t), for a particle in 
the ground state of the harmonic oscillator. What is the probability (to 2 significant 
digits) that a measurement of p on a particle in this state would yield a value 
outside the classical range (for the same energy)? Hint: Look in a math table 
under “Normal Distribution" or “Error Function" for the numerical part—or use 
Mathematica. 


Problem 3.12 Show that 


(x) = fo (5) o dp. [3.57] 


Hint: Notice that x exp(ipx/h) = —ih(d/dp) exp(ipx/h). 
In momentum space, then, the position operator is iHd/dp. More generally, 


^ h 0 
J w*O (x =) Wdx, in position space; 
i ! 
(Q(x. p)) = ha [3.58] 
J $*Q (- 3p" p) dp, in momentum space. 
i Op 


In principle you can do all calculations in momentum space just as well (though 
not always as easily) as in position space. 
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3.5 THE UNCERTAINTY PRINCIPLE 


I stated the uncertainty principle (in the form oxo, > h/2), back in Section 1.6, 
and you have checked it several times, in the problems. But we have never actually 
proved it. In this section I will prove a more general version of the uncertainty 
principle, and explore some of its implications. The argument is beautiful, but 
rather abstract, so watch closely. 


3.5.1 Proof of the Generalized Uncertainty Principle 


For any observable A, we have (Equation 3.21): 
of = (Â — (A WI — (A) = (FIP), 
where f = (A — (A))W. Likewise, for any other observable, B, 
og = (glg), where g = (B — (B))V. 
Therefore (invoking the Schwarz inequality, Equation 3.7), 


oxoR = (FAf)(gle) = Wf le). [3.59] 


Now, for any complex number z, 
3 l : 
Iz = [Re(z)}? + lImG) > Imc) = EG - | ; [3.60] 


Therefore, letting z = (f |g), 


5 2 1 2 
oho} > (Fle) - ei) - [3.61] 


But 


V|(AB — A(B) — B(A) + (A)(B))¥) 


Similarly, 


(g1/) = (BA) — (A)(B), 
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SO 


(Fig) — (glf) = (AB) — (BA) = ([À, B], 


where 


[3.62] 


This is the (generalized) uncertainty principle. You might think the i makes it 
trivial—isn’t the right side negative? No, for the commutator of two hermitian 
operators carries its own factor of i, and the two cancel out.!? 

As an example, suppose the first observable is position (A = x), and the 
second is momentum (B = (h/i)d/dx). We worked out their commutator back in 
Chapter 2 (Equation 2.51): 


So 


or, since standard deviations are by their nature positive, 


; 
asop > >. [3.63] 


That's the original Heisenberg uncertainty principle, but we now see that it is just 
one application of a much more general theorem. 

There is, in fact, an "uncertainty principle" for every pair of observables whose 
operators do not commute —we call them incompatible observables. Incompatible 
observables do not have shared eigenfunctions—at least, they cannot have a com- 
plete set of common eigenfunctions (see Problem 3.15). By contrast, compatible 
(commuting) observables do admit complete sets of simultaneous eigenfunctions.!? 


l7 More precisely, the commutator of two hermitian operators is itself arti -hermitian (Q* = -Ó. 
and its expectation value is imaginary (Problem 3.26). 


I8This corresponds to the fact that noncommuting matrices cannot be simultaneously diagonalized 
(that is, they cannot both be brought to diagonal form by the same similarity transformation), whereas 
commuting hermitian matrices can be simultaneously diagonalized. See Section A.5. 
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For example, in the hydrogen atom (as we shall see in Chapter 4) the Hamilto- 
nian, the magnitude of the angular momentum, and the z component of angular 
momentum are mutually compatible observables, and we will construct simultane- 
ous eigenfunctions of all three, labeled by their respective eigenvalues. But there is 
no eigenfunction of position that is also an eigenfunction of momentum, because 
these operators are incompatible. 

Note that the uncertainty principle is not an extra assumption in quantum 
theory, but rather a consequence of the statistical interpretation. You might wonder 
how it is enforced in the laboratory—why can't you determine (say) both the 
position and the momentum of a particle? You can certainly measure the position 
of the particle, but the act of measurement collapses the wave function to a narrow 
spike, which necessarily carries a broad range of wavelengths (hence momenta) 
in its Fourier decomposition. If you now measure the momentum, the state will 
collapse to a long sinusoidal wave, with (now) a well-defined wavelength— but 
the particle no longer has the position you got in the first measurement.!? The 
problem, then, is that the second measurement renders the outcome of the first 
measurement obsolete. Only if the wave function were simultaneously an eigenstate 
of both observables would it be possible to make the second measurement without 
disturbing the state of the particle (the second collapse wouldn't change anything, 
in that case). But this is only possible, in general, if the two observables are 
compatible. 


xProblem 3.13 


(a) Prove the following commutator identity: 
[AB. C] = A[B. C] - [A. C]B. [3.64] 


(b) Show that 


[x". p] = ihnx" |. 


(c) Show more generally that 


—, [3.65] 
dx 


d 
ORE e 


for any function f(x). 


I?Niels Bohr was at pains to track down the mechanism by which the measurement of x (for 
instance) destroys the previously existing value of p. The crux of the matter is that in order to determine 
the position of a particle you have to poke it with something —shine light on it. say. But these photons 
impart to the particle a momentum you cannot control. You now know the position, but you no longer 
know the momentum. His famous debates with Einstein include many delightful examples. showing 
in detail how experimental constraints enforce the uncertainty principle. For an inspired account see 
Bohr's article in Albert Einstein: Philosopher-Scientist. edited by P. A. Schilpp, Tudor, New York 
(1949). 
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*Problem 3.14 Prove the famous “(your name) uncertainty principle,” relating the 
uncertainty in position (A = x) to the uncertainty in energy (B = p*/2m + Vy: 


hi 
0x0H > —|(p)|. 
2m 


For stationary states this doesn't tell you much— why not? 


Problem 3.15 Show that two noncommuting operators cannot have a complete set 
of common eigenfunctions. Hint: Show that if P and Q have a complete set of 
common eigenfunctions, then [P. Q]f = 0 for any function in Hilbert space. 


3.5.2 The Minimum-Uncertainty Wave Packet 


We have twice encountered wave functions that hit the position-momentum 
uncertainty limit (0,0, = f/2): the ground state of the harmonic oscillator 
(Problem 2.11) and the Gaussian wave packet for the free particle (Problem 2.22). 
This raises an interesting question: What is the most general minimum-uncertainty 
wave packet? Looking back at the proof of the uncertainty principle, we note that 
there were two points at which inequalities came into the argument: Equation 3.59 
and Equation 3.60. Suppose we require that each of these be an equality, and see 
what this tells us about V. 

The Schwarz inequality becomes an equality when one function is a multi- 
ple of the other: g(x) = cf (x), for some complex number c (see Problem A.5). 
Meanwhile, in Equation 3.60 I threw away the real part of z; equality results if 
Re(z) = 0, which is to say, if Re(f |g) = Re(c(f|f)) = 0. Now, (f |f) is certainly 
real, so this means the constant c must be purely imaginary — let's call it ia. The 
necessary and sufficient condition for minimum uncertainty, then, is 


g(x) =iaf(x), where a is real. [3.66] 
For the position-momentum uncertainty principle this criterion becomes: 
hd 
Gz 5 2 V = ia(x — (x))V. [3.67] 
i dx 


which is a differential equation for V as a function of x. Its general solution 
(Problem 3.16) is 
W(x) = Ae ae (8) Ph gi (pyx [hi [3.68] 


Evidently the minimum-uncertainty wave packet is a gaussian —and the two exam- 
. : . 2 
ples we encountered earlier were gaussians.^ 


20Note that it is only the dependence of Y on x that is at issue here—the “constants” A. a. (x). 
and (p) may all be functions of time. and for that matter Y may evolve away from the minimal form. 
All I'm asserting is that if, at some instant. the wave function is gaussian in x. then (at that instant) the 
uncertainty product is minimal. 
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Problem 3.16 Solve Equation 3.67 for ¥(x). Note that (x) and (p) are constants. 


3.5.3 The Energy-Time Uncertainty Principle 


The position-momentum uncertainty principle is often written in the form 
h 
Ax Ap = 5: [3.69] 


Ax (the “uncertainty” in x) is loose notation (and sloppy language) for the standard 


deviation of the results of repeated measurements on identically prepared systems.?! 


Equation 3.69 is often paired with the energy-time uncertainty principle, 


; 
At AE > 7: [3.70] 


Indeed, in the context of special relativity the energy-time form might be thought 
of as a consequence of the position-momentum version, because x and t (or 
rather, cf) go together in the position-time four-vector, while p and E (or rather, 
E /c) go together in the energy-momentum four-vector. So in a relativistic theory 
Equation 3.70 would be a necessary concomitant to Equation 3.69. But we're not 
doing relativistic quantum mechanics. The Schródinger equation is explicitly non- 
relativistic: It treats t and x on a very unequal footing (as a differential equation 
it is first-order in t, but second-order in x), and Equation 3.70 is emphatically not 
implied by Equation 3.69. My purpose now is to derive the energy-time uncertainty 
principle, and in the course of that derivation to persuade you that it is really an 
altogether different beast, whose superficial resemblance to the position-momentum 
uncertainty principle is actually quite misleading. 

After all, position, momentum, and energy are all dynamical variables— 
measurable characteristics of the system, at any given time. But time itself is 
not a dynamical variable (not, at any rate, in a nonrelativistic theory): You don't 
go out and measure the "time" of a particle, as you might its position or its energy. 
Time is the independent variable, of which the dynamical quantities are func- 
tions. In particular, the Aż in the energy-time uncertainty principle is not the 
standard deviation of a collection of time measurements; roughly speaking (I'll 
make this more precise in a moment) it is the time it takes the system to change 
substantially. 


?! Many casual applications of the uncertainty principle are áctually based (often inadvertently) on 
a completely different —and sometimes quite unjustified — measure of "uncertainty." Conversely. some 
perfectly rigorous arguments use other definitions of "uncertainty." See Jan Hilgevoord. Arm. J. Phys. 
70, 983 (2002). 
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As a measure of how fast the system is changing, let us compute the time 
derivative of the expectation value of some observable, Q(x. p. t): 


d a0 RET 
E )- lez). 


d ^ OW] a 
7 (2) = (wl Qv) -(>| ou) « (s 


Now, the Schródinger equation says 


(where H = p?/2m + V is the Hamiltonian). So 


E oes L A L 
412 T a a + zp VIQHY) +( Ór ) 


But # is hermitian, so (HU|OW) = (VIR Qw), and hence 


[3.71] 


This is an interesting and useful result in its own right (see Problems 3.17 and 
3.31). In the typical case where the operator does not depend explicitly on time,?? 
it tells us that the rate of change of the expectation value is determined by the 
commutator of the operator with the Hamiltonian. In particular, if Q commutes 
with H, then (Q) is constant, and in this sense Q is a conserved quantity. 

Now, suppose we pick A = H and B = Q, in the generalized uncertainty 
principle (Equation 3.62), and assume that Q does not depend explicitly on f: 


- lox ^. Odd aN ay 
ohog = (508.20) - (5775) -(5) (Gar): 


Or, more simply, 


h |d(Q) 
> — i 3.72 
andoz | [3.72] 
Let's define AE = og, and 
TQ 
At = ———=—_.. [3.73] 
id (Q) /dt| 


22 Operators that depend explicitly on f are quite rare. so almost always aQ /dt = 0. As an 
example of explicit time dependence, consider the potential energy of a harmonic oscillator whose 
spring constant is changing (perhaps the temperature is rising. so the spring becomes more flexible): 


Q = (1/2)m[w(t) Px. 
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Then h 
AE At > 5 [3.74] 


and that’s the energy-time uncertainty principle. But notice what is meant by Af, 
here: Since 


= Ae 
?Q 7 at 


At represents the amount of time it takes the expectation value of Q to change by 
one standard deviation.?? In particular, At depends entirely on what observable 
(Q) you care to look at—the change might be rapid for one observable and slow 
for another. But if AE is small, then the rate of change of all observables must be 
very gradual; or, to put it the other way around, if any observable changes rapidly, 
the "uncertainty" in the energy must be large. 


At, 


Example 3.5 In the extreme case of a stationary state, for which the energy is 
uniquely determined, all expectation values are constant in time (AE = 0 => 
At = o)—as in fact we noticed some time ago (see Equation 2.9). To make 
something happen you must take a linear combination of at least two stationary 
states—say: 


W(x, t) = avi Qe 1 P^ + baa (x)e i ELA, 
If a, b, Yi, and y» are real, 
2 2 2 2 2 Ej;—E| 
[i (x, 0J^ = a“ (Wy (x))° + b^ (Q2 Q0) + 2aby (x) (x) cos MEC à 
The period of oscillation is tT = 27h /(E2— Ej). Roughly speaking, AE = E? — Ej 
and Ar = Tt (for the exact calculation see Problem 3.18), so 


AE At = 27h, 
which is indeed >fi/2. 


Example 3.6 How long does it take a free-particle wave packet to pass by a partic- 
ular point (Figure 3.1)? Qualitatively (an exact version is explored in Problem 3.19), 
At = Ax/v = mAx/p, but E = p*/2m, so AE = pAp/m. Therefore, 


pAp mAx 


AE At = = AxAp, 


m 


which is >f/2 by the position-momentum uncertainty principle. 


?*This is sometimes called the "Mandelstam-Tamm" formulation of the energy-time uncertainty 
principle. For a review of alternative approaches see Paul Busch, Found. Phys. 20, 1 (1990). 
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e————— Ax —————»| 


A x 


FIGURE 3.1: A free particle wave packet approaches the point A (Example 3.6). 


Example 3.7 The A particle lasts about 1072? seconds, before spontaneously 
disintegrating. If you make a histogram of all measurements of its mass, you get 
a kind of bell-shaped curve centered at 1232 MeV/c’, with a width of about 120 
MeV/c? (Figure 3.2). Why does the rest energy (mc?) sometimes come out higher 
than 1232, and sometimes lower? Is this experimental error? No, for 


1 
AE At = (> Mev) (1073 sec) = 6 x 107? MeV sec, 


whereas fi/2 = 3 x 1077? MeV sec. So the spread in m is about as small as the 
uncertainty principle allows—a particle with so short a lifetime just doesn't have 
a very well-defined mass. 


1100 1200 1300 1400 
MASS (MeV/c?) 


FIGURE 3.2: Histogram of measurements of the A mass (Example 3.7). 


H Actually, Example 3.7 is a bit of a fraud. You can't measure 1077? sec on a stop-watch, and in 
practice the lifetime of such a short-lived particle is inferred from the width of the mass plot, using the 
uncertainty principle as input. However. the point is valid, even if the logic is backwards. Moreover, 
if you assume the A is about the same size as à proton (107 !? m), then 107? sec is roughly the 
time it takes light to cross the particle. and it’s hard to imagine that the lifetime could be much less 
than that. 
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Notice the variety of specific meanings attaching to the term Ar in these 
examples: In Example 3.5 it’s a period of oscillation: in Example 3.6 it’s the time 
it takes a particle to pass a point; in Example 3.7 it’s the lifetime of an unstable 
particle. In every case, however, Ar is the time it takes for the system to undergo 
“substantial” change. 

It is often said that the uncertainty principle means energy is not strictly 
conserved in quantum mechanics—that you’re allowed to “borrow” energy AE, 
as long as you “pay it back" in a time At 7 i/(2AE); the greater the violation, 
the briefer the period over which it can occur. Now, there are many legitimate 
readings of the energy-time uncertainty principle, but this is not one of them. 
Nowhere does quantum mechanics license violation of energy conservation, and 
certainly no such authorization entered into the derivation of Equation 3.74. But the 
uncertainty principle is extraordinarily robust: It can be misused without leading 
to seriously incorrect results, and as a consequence physicists are in the habit of 
applying it rather carelessly. 


xProblem 3.17 Apply Equation 3.71 to the following special cases: (a) Q = 1; 
(b) Q = H; (c) Q = x; (d) Q = p. In each case, comment on the result, 
with particular reference to Equations 1.27, 1.33, 1.38, and conservation of energy 
(comments following Equation 2.39). 


Problem 3.18 Test the energy-time uncertainty principle for the wave function in 
Problem 2.5 and the observable x, by calculating oy, Cy, and d(x)/dt exactly. 


Problem 3.19 Test the energy-time uncertainty principle for the free particle wave 
packet in Problem 2.43 and the observable x, by calculating oy, oy, and d(x)/df 
exactly. 


Problem 3.20 Show that the energy-time uncertainty principle reduces to the “your 
name" uncertainty principle (Problem 3.14), when the observable in question is x. 


3.6 DIRAC NOTATION 


Imagine an ordinary vector À in two dimensions (Figure 3.3(a)). How would you 
describe this vector to someone? The most convenient way is to set up carte- 
sian axes, x and y, and specify the components of A: Ay — 1: A. A, = J-A 
(Figure 3.3(b)). Of course, your sister might have drawn a different set of axes, 
x’ and y’, and she would report different components: A‘. = ?’- A. A, = J-A 
(Figure 3.3(c)). But it's all the same vector —we're simply expressing it with 
respect to two different bases ({7. J} and {7’, 7’}). The vector itself lives "out there 
in space," independent of anybody's (arbitrary) choice of coordinates. 
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(a) (b) (c) 


FIGURE 3.3: (a) Vector A. (b) Components of A with respect to xy axes. 
(c) Components of A with respect to x’y’ axes. 


The same is true for the state of a system in quantum mechanics. It is rep- 
resented by a vector, |4(t)), that lives “out there in Hilbert space,” but we can 
express it with respect to any number of different bases. The wave function W(x. f) 
is actually the coefficient in the expansion of |4) in the basis of position eigen- 
functions: 

W(x, t) = (x|4(1)). [3.75] 


(with |x) standing for the eigenfunction of 3 with eigenvalue x),2> whereas the 
momentum space wavefunction ®(p.r) is the expansion of |4) in the basis of 
momentum eigenfunctions: 


(p.t) = (p|&(r) [3.76] 


(with |) standing for the eigenfunction of p with eigenvalue p).?$ Or we could 
expand |4) in the basis of energy eigenfunctions (supposing for simplicity that the 
spectrum is discrete): 

Cn (t) = (n|&(r)) [3.77] 


(with |n) standing for the nth eigenfunction of H )— Equation 3.46. But it’s all the 
same state; the functions V and 4, and the collection of coefficients {c,,}, contain 
exactly the same information—they are simply three different ways of describing 
the same vector: 


| 
J/ 2x h 


cx que ied y [3.78] 


V(x.1)— J V(y.D8(x — y)dy = J (p.t) eP¥/ dp 


JST don't want to call it gy (Equation 3.39). because that is its form in the position basis, and 
the whole point here is to free ourselves from any particular basis. Indeed. when I first defined Hilbert 
space as the set of squarc-integrable functions —over x—that was already too restrictive. committing 
us to a specific representation (the position basis). | want now to think of it as an abstract vector space. 
whose members can be expressed with respect to any basis you like. 


*61n position space it would be Jp (x) (Equation 3.32). 
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Operators (representing observables) are linear transformations—they “trans- 
form” one vector into another: 


I8) = Ola). [3.79] 


Just as vectors are represented, with respect to a particular basis {|e,)},7’ by their 
components, 


la) = X anlen). with a, = (enlæ): |B) = So balen). with b, = (en|B). [3.80] 
n n 


operators are represented (with respect to a particular basis) by their matrix ele- 
28 
ments 


(ewlÓlen) = Qm. [3.81] 
In this notation Equation 3.79 takes the form 
3 bu len) = S| an Olen). [3.82] 
n n 
or, taking the inner product with |en), 
S bn (emlen) = S| an (em| Olen). [3.83] 
n Hi 
and hence 
bm = 27 Qnin. [3.84] 
n 


Thus the matrix elements tell you how the components transform. 

Later on we will encounter systems that admit only a finite number (N) of 
linearly independent states. In that case |4(r)) lives in an N-dimensional vector 
space; it can be represented as a column of (N) components (with respect to a 
given basis), and operators take the form of ordinary (N x N) matrices. These 
are the simplest quantum systems—none of the subtleties associated with infinite- 
dimensional vector spaces arise. Easiest of all is the two-state system, which we 
explore in the following example. 


Example 3.8 Imagine a system in which there are just two linearly independent 


states;?? 
l 0 
|1) = (o) and |2) = (7) : 


FITI assume the basis is discrete: otherwise n becomes a continuous index and the sums are 
replaced by integrals. 

?SThis terminology is inspired. obviously. by the finite-dimensional case, but the "matrix" will 
now typically have an infinite (maybe even uncountable) number of elements. 

Technically. the "equals" signs here mean “is represented by." but I don't think any confusion 
will arise if we adopt the customary informal notation. 
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The most general state is a normalized linear combination: 


a 


|4) = a|1) + b12) = [ 


J with |a|* + |b|? = 1. 


The Hamiltonian can be expressed as a (hermitian) matrix; suppose it has the 


specific form 
Bras (" a l 
g h 


where g and h are real constants. If the system starts out (at t = 0) in state |1), 
what is its state at time t? 


Solution: The (time-dependent) Schrödinger equation says 
TEMPS = H4) [3.85] 
nj My : 
As always, we begin by solving the time-independent Schródinger equation: 
H|«) = E|«); [3.86] 
that is, we look for the eigenvectors and eigenvalues of H. The characteristic 
equation determines the eigenvalues: 


s (^ LP pig) = @- EPP a0 h- Em am Eg hte, 


Evidently the allowed energies are (h + g) and (h — g). To determine the eigen- 
vectors, we write 


d s PEEDIMELDLCEDET EET 


so the normalized eigenvectors are 


d) 


Next we expand the initial state as a linear combination of eigenvectors of the 
Hamiltonian: 


l4) = (6) = 4 (s+) +14). 
Finally, we tack on the standard time-dependence exp(—iE,,t/h): 
1 
a 


[e 8h gy sb east, y] 


|S(t)) = 
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l inn | igih (1 igh (. l 
= 5° e 1 +e E 
1 can (e 9 EIN -imya ( cos(gt/R) 
2 e i8 /h XE git /^ —i sin(gt/h) j 
If you doubt this result, by all means check it: Does it satisfy the time-dependent 
Schrödinger equation? Does it match the initial state when t = 0? 
This is a crude model for (among other things) neutrino oscillations. In 
that case |1) represents the electron neutrino, and |2) the muon neutrino; if the 


Hamiltonian has a nonvanishing off-diagonal term (g) then in the course of time 
the electron neutrino will turn into a muon neutrino (and back again). 


Dirac proposed to chop the bracket notation for the inner product, (@|f), into 
two pieces, which he called bra, (œ|, and ket, |P) (I don’t know what happened to 
the c). The latter is a vector, but what exactly is the former? It's a linear function 
of vectors, in the sense that when it hits a vector (to its right) it yields a (complex) 
number—the inner product. (When an operator hits a vector, it delivers another 
vector; when a bra hits a vector, it delivers a number.) In a function space, the bra 
can be thought of as an instruction to integrate: 


uto f reas 
with the ellipsis [--- ] waiting to be filled by whatever function the bra encounters in 


the ket to its right. In a finite-dimensional vector space, with the vectors expressed 
as columns, 


ay 
a? 
œ=. I, [3.87] 
ay 
the corresponding bra is a row vector: 
ta e (atat. oot y [3.88] 


The collection of all bras constitutes another vector space—the so-called dual 
space. 

The license to treat bras as separate entities in their own right allows for 
some powerful and pretty notation (though I shall not exploit it in this book). For 
example, if |o) is a normalized vector, the operator 


P = ja) lal [3.89] 
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picks out the portion of any other vector that “lies along" |o): 


P|B) = (a|B)|a): 


we call it the projection operator onto the one-dimensional subspace spanned by 
la). If {]e,)} is a discrete orthonormal basis, 


(emlen) = Omn- [3.90] 
then 
BIDICIES [3.91] 
n 


(the identity operator). For if we let this operator act on any vector |a), we recover 
the expansion of |æ) in the {|e,,)} basis: 


Š len) (en lee) = lo). [3.92] 
n 
Similarly, if (|e;)) is a Dirac orthonormalized continuous basis, 
(ezlez) = 8(z — z’). [3.93] 
then 
f eee =]; [3.94] 


Equations 3.91 and 3.94 are the tidiest ways to express completeness. 


Problem 3.21 Show that projection operators are idempotent: P? = Ê. Determine 
the eigenvalues of P, and characterize its eigenvectors. 


Problem 3.22 Consider a three-dimensional vector space spanned by an orthonor- 
mal basis |1), |2), |3). Kets |o) and |8) are given by 


læ) = ill) — 212) — i3. |B) = i11) + 213). 


(a) Construct (œ| and (£| (in terms of the dual basis (1|, (2|, (3]). 
(b) Find (oa|8) and (|æ), and confirm that (8|o) = (a|B)*. 


(c) Find all nine matrix elements of the operator A = |æ) (l, in this basis, and 
construct the matrix A. Is it hermitian? 
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Problem 3.23 The Hamiltonian for a certain two-level system is 
H = e (|1)(1] — [2)(2] + 11)(21 + [2)(1)). 


where |1), |2) is an orthonormal basis and € is a number with the dimensions of 
energy. Find its eigenvalues and eigenvectors (as linear combinations of |1) and 
|2)). What is the matrix H representing H with respect to this basis? 


Problem 3.24 Let Ô be an operator with a complete set of orthonormal eigenvec- 
tors: 


Olen) = qnlen) = le 233382): 


Show that Q can be written in terms of its spectral decomposition: 


Ó = ` qn en) (en. 
n 


Hint: An operator is characterized by its action on all possible vectors, so what 
you must show is that 


Ole) = | aule» (esl p læ). 


for any vector |a). 


FURTHER PROBLEMS FOR CHAPTER 3 


Problem 3.25 Legendre polynomials. Use the Gram-Schmidt procedure (Prob- 
lem A.4) to orthonormalize the functions 1, x, x?, and x?, on the interval —1 « 
x < 1. You may recognize the results—they are (apart from the normalization)?? 
Legendre polynomials (Table 4.1). 


Problem 3.26 An anti-hermitian (or skew-hermitian) operator is equal to minus 
its hermitian conjugate: 

Q'-—-Q. [3.95] 

(a) Show that the expectation value of an anti-hermitian operator is imaginary. 


(b) Show that the commutator of two hermitian operators is anti-hermitian. How 
about the commutator of two anti-hermitian operators? 


301 egendre didn't know what the best convention would be: he picked the overall factor so that 
all his functions would go to | at x = 1. and we're stuck with his unfortunate choice. 
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Problem 3.27 Sequential measurements. An operator A, representing observ- 
able A, has two normalized eigenstates y; and y2, with eigenvalues a, and a, 
respectively. Operator B, representing observable B, has two normalized eigen- 
states $, and $», with eigenvalues b; and b». The eigenstates are related by 


Vi = 3601 +4¢2)/5, V = (461 — 362)/5. 


(a) Observable A is measured, and the value a; is obtained. What is the state of 
the system (immediately) after this measurement? 


(b) If B is now measured, what are the possible results, and what are their 
probabilities? 


(c) Right after the measurement of B, A is measured again. What is the proba- 
bility of getting a)? (Note that the answer would be quite different if I had 
told you the outcome of the B measurement.) 


x Problem 3.28 Find the momentum-space wave function &, (p. t) for the nth sta- 
tionary state of the infinite square well. Graph |®)(p. Dp? and IDo(p.t)I?, as 
functions of p (pay particular attention to the points p = tnzh/a). Use $, (p.t) 
to calculate the expectation value of p?. Compare your answer to Problem 2.4. 


Problem 3.29 Consider the wave function 


] 
V(x.0—1 J2nA 


0. otherwise, 


Carer 
e ni ecu enh. 


where n is some positive integer. This function is purely sinusoidal (with wavelength 
A) on the interval —nA < x < n2, but it still carries a range of momenta, because 
the oscillations do not continue out to infinity. Find the momentum space wave 
function ®¢(p. 0). Sketch the graphs of |Y (x, 0)|? and |®(p,0)|?, and determine 
their widths, wy and wp (the distance between zeros on either side of the main 
peak). Note what happens to each width as n — oo. Using wy and wy as estimates 
of Ax and Ap, check that the uncertainty principle is satisfied. Warning: If you try 
calculating op, you're in for a rude surprise. Can you diagnose the problem? 


Problem 3.30 Suppose 


A 
POSSET (—co «X « oo) 


for constants A and a. 


126 Chapter 3 Formalism 


(a) Determine A, by normalizing V(x, 0). 
(b) Find (x), (x2), and c, (at time t = 0). 


(c) Find the momentum space wave function (p. 0), and check that it is nor- 
malized. 


(d) Use (p. 0) to calculate (p), (p?). and c; (at time ¢ = 0). 


(e) Check the Heisenberg uncertainty principle for this state. 


xProblem 3.31 Virial theorem. Use Equation 3.71 to show that 


d dV 
Z xp) = 2T) - i) | [3.96] 


where T is the kinetic energy (H = T + V). In a stationary state the left side is 
zero (why?) so 


Ty = LE) [3.97] 
This is called the virial theorem. Use it to prove that (T) = (V) for stationary 


states of the harmonic oscillator, and check that this is consistent with the results 
you got in Problems 2.11 and 2.12. 


Problem 3.32. In an interesting version of the energy-time uncertainty principle?! 
At = t/7, where t is the time it takes V(x.f) to evolve into a state orthogo- 
nal to V(x. 0). Test this out, using a wave function that is an equal admixture 
of two (orthonormal) stationary states of some (arbitrary) potential: Y(x,0) = 


Q/A2)[VA (x) + Va ()]. 


x «Problem 3.33 Find the matrix elements (z|x |n') and (n|p|n’) in the (orthonormal) 
basis of stationary states for the harmonic oscillator (Equation 2.67). You already 
calculated the “diagonal” elements (n = n’) in Problem 2.12; use the same tech- 
nique for the general case. Construct the corresponding (infinite) matrices, X and P. 
Show that (1/2m)P? + (mo? /2)X? = H is diagonal, in this basis. Are its diagonal 
elements what you would expect? Partial answer: 


| h 
(n|x|n^) = Lm (Vi at nx Vb aci) : [3.98] 


3lSee Lev Vaidman, Am. J. Phys. 60. 182 (1992) for a proof. 
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Problem 3.34 A harmonic oscillator is in a state such that a measurement of the 
energy would yield either (1/2)/iw or (3/2)hw, with equal probability. What is the 
largest possible value of (p) in such a state? If it assumes this maximal value at 
time tf = 0, what is Y (x,t)? 


x x Problem 3.35 Coherent states of the harmonic oscillator. Among the stationary 
states of the harmonic oscillator (|n) = w,,(x), Equation 2.67) only n = 0 hits the 
uncertainty limit (0,0, = f/2); in general, oop = (2n + 1)/i/2, as you found 
in Problem 2.12. But certain linear combinations (known as coherent states) also 


minimize the uncertainty product. They are (as it turns out) eigenfunctions of the 


lowering operator:?? 


a. |a) = aa) 
(the eigenvalue œ can be any complex number). 


(a) Calculate (x), (x?), (p), (p?) in the state |œ). Hint: Use the technique in 
Example 2.5, and remember that a4 is the hermitian conjugate of a... Do not 
assume o is real. 


(b) Find c; and op; show that oxop = h/2. 


(c) Like any other wave function, a coherent state can be expanded in terms of 
energy eigenstates: 


la) = D Cy |n). 


n=0 
Show that the expansion coefficients are 
a” 
vn! 


(d) Determine co by normalizing |æ}. Answer: exp(—|a|?/2). 


Cn = co. 


(e) Now put in the time dependence: 
|n) — eE ny, 


and show that |a(r)) remains an eigenstate of a_, but the eigenvalue evolves 
in time: 

a(t) = e a. 
So a coherent state stays coherent, and continues to minimize the uncertainty 
product. 


2 à " . lu 
32There are no normalizable eigenfunctions of the raising operator. 
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(f) Is the ground state (|n = 0)) itself a coherent state? If so, what is the eigen- 
value? 


Problem 3.36 Extended uncertainty principle.” The generalized uncertainty 
principle (Equation 3.62) states that 


— 
t2 


(C)*, 


2 2 

2 Quom 
9498 24 
where C = —i[A, B]. 

(a) Show that it can be strengthened to read 


2 


2 l 2 
a406 2 4 (CY + (D)), [3.99] 


where D = Â Ê + BÁ—2(A) (B). Hint: Keep the Re(z) term in Equation 3.60. 


(b) Check Equation 3.99 for the case B — A (the standard uncertainty principle 
is trivial, in this case, since C — 0; unfortunately, the extended uncertainty 
principle doesn't help much either). 


Problem 3.37 The Hamiltonian for a certain three-level system is represented by 


the matrix 
‘a 0 b 
H=10 c O0], 
b 0 a 


where a, b, and c are real numbers (assume a — c 4 + b). 


(a) If the system starts out in the state 


0 
140) = | 1]. 
0 
what is |4(1))? 
(b) If the system starts out in the state 
0 
|8(0)) = | 0 


what is |4(t))? 


33 For interesting commentary and references, see R. R. Puri, Phys. Rev. A 49, 2178 (1994). 
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Problem 3.38 The Hamiltonian for a certain three-level system is represented by 
the matrix 


100 
H=fhw|0 2 0 
0 0 2 


Two other observables, A and B, are represented by the matrices 


0 1 0 2 0 0 
A=,A}1 0 0], B-4|[0 O IJ, 
0 0 2 0 1 0 


where c, A, and u are positive real numbers. 
(a) Find the eigenvalues and (normalized) eigenvectors of H, A, and B. 


(b) Suppose the system starts out in the generic state 


c] 
14(0) = | c2 
C3 


with |cil? + |c2|? + |c3|* = 1. Find the expectation values (at t = 0) of H, 
A, and B. 


(c) What is |4(t))? If you measured the energy of this state (at time t), what 
values might you get, and what is the probability of each? Answer the same 
questions for A and for B. 


x xProblem 3.39 
(a) For a function f(x) that can be expanded in a Taylor series, show that 
T p 
f G + xo) = APA F(x) 


(where xo is any constant distance). For this reason, p/f is called the genera- 
tor of translations in space. Note: The exponential of an operator is defined 
by the power series expansion: e? = 1+ Ô + (0/20? + (1/30 Q? 4 .... 


(b) If w(x,1) satisfies the (time-dependent) Schrödinger equation, show that 
V(x,t + to) = gi Ptolh iy cy. t) 


(where fo is any constant time); — /h is called the generator of translations 
in time. 


130 Chapter 3 Formalism 


(c) Show that the expectation value of a dynamical variable Q(x, p, t), at time 
t + to, can be written? 


(Qua, = (Wx. Dli OR. Bot + tye T P/N (x, 1). 


Use this to recover Equation 3.71. Hint: Let tọ = dt, and expand to first 
order in dt. 


x «Problem 3.40 


(a) Write down the time-dependent “Schrödinger equation" in momentum space, 
for a free particle, and solve it. Answer: exp(—ip?t /2mh) ®(p. 0). 


(b) Find ®(p.0) for the traveling gaussian wave packet (Problem 2.43), and 
construct (p, f) for this case. Also construct |®(p. r)|?, and note that it is 
independent of time. 


(c) Calculate (p) and (p?) by evaluating the appropriate integrals involving c, 
and compare your answers to Problem 2.43. 


(d) Show that (H) = (p)?/2m + (H)o (where the subscript 0 denotes the sta- 
tionary gaussian), and comment on this result. 


MIn particular. if we set t = 0. and drop the subscript on tọ. 
(Q(t) = (W(x, DI Qvo. 0) = (W(x. OT 7! QU wo 0)). 


where U = exp(—i Ât /ħ). This says that you can calculate expectation values of Q either by sand- 
wiching Q between V (x. 1)” and V(x.r). as we have always done (letting the wave functions carry 
the time dependence), or e/se by sandwiching U-'QU between Vr. 0)* and Y(x. 0). letting the 
operator carry the time dependence. The former is called the Schródinger picture, and the latter the 
Heisenberg picture. 


CHAPTER 4 


QUANTUM MECHANICS IN 
THREE DIMENSIONS 


4.1 SCHRODINGER EQUATION IN SPHERICAL COORDINATES 


The generalization to three dimensions is straightforward. Schródinger's equation 
Says 
ow 


ih— = HW; [4.1] 
or 


the Hamiltonian operator! H is obtained from the classical energy 
Le a PLE T 44 pty hy 
2 2m ICT Eos 


by the standard prescription (applied now to y and z, as well as x): 


ho ħ ð ho 
Peer oat P1 ee usto [4.2] 


! Where confusion might otherwise occur I have been putting “hats” on operators. to distinguish 
them from the corresponding classical observables. I don't think there will be much occasion for 
ambiguity in this chapter. and the hats get to be cumbersome. so l am going to leave them off from 
now on. 
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or 


[4.3] 


for short. Thus 


[4.4] 


where 
v= a T PR] t 4.2 [4.5] 


is the Laplacian, in cartesian coordinates. 

The potential energy V and the wave function V are now functions of r — 
(x. v. z) and t. The probability of finding the particle in the infinitesimal volume 
d?r — dx dv dz is |W(r. Dp? d?r, and the normalization condition reads 


f ween- l. [4.6] 


with the integral taken over all space. If the potential is independent of time, there 
will be a complete set of stationary states, 


V, (r. f) = v, (r)e Ent [4.7] 


where the spatial wave function y, satisfies the time-independent Schrödinger 
equation: 


h? 


—— Vy E Vy = Ey. [4.8] 
2m 


The general solution to the (time-dependent) Schródinger equation is 
W(r.t) = »3 curs (rye iE, . [4.9] 


with the constants c, determined by the initial wave function, W(r,0), in the 
usual way. (If the potential admits continuum states, then the sum in Equation 4.9 
becomes an integral.) 
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*Problem 4.1 


(a) Work out all of the canonical commutation relations for components of the 
operators r and p: [x. y], Ly. py]. [x. px], [py. pz], and so on. Answer: 


(ri. pjl— —lpi.rj])z-ihég. [ri.rj] = [pi pj] — 0. [4.10] 
where the indices stand for x, y, or z, and ry =x, ry = y, and F; = z. 


(b) Confirm Ehrenfest’s theorem for 3-dimensions: 
d ] d 
— = — " d ari = —VV . 4, l 
4; — (ph and z (p) = ( ) [4.11] 


(Each of these, of course, stands for three equations—one for each compo- 
nent.) Hint: First check that Equation 3.71 is valid in three dimensions. 
(c) Formulate Heisenberg’s uncertainty principle in three dimensions. Answer: 


0x0p, = h/2, 0y0p, 7 h/2. 0:0», = h/2, [4.12] 


but there is no restriction on, say, 0405.,. 


4.1.1 Separation of Variables 


Typically, the potential is a function only of the distance from the origin. In that 
case it is natural to adopt spherical coordinates, (r.0, $) (see Figure 4.1). In 
spherical coordinates the Laplacian takes the form? 


1.39.7759 1 à ə ] 9 
whzdoct avg. S d ungue. Te. ur. wena 
rr ( x us 00 (sin x tro (552) RPM 


In spherical coordinates, then, the time-independent Schródinger equation reads 


h^[108 f ,8y 1 9 av l 0?y 
c EE LIE (gr ccs scr feme | hue 
2m E ar ( ar ) * Fisinó 86 (n x) Tum (55:)] 
Vy — Ey. [414] 


We begin by looking for solutions that are separable into products: 


y (r. 0. 9) = R(r)Y (0. $). [4.15] 


?In principle, this can be obtained by change of variables from the cartesian expression 
(Equation 4.5). However. there are much more efficient ways of getting it; see. for instance, M. Boas, 
Mathematical Methods in the Physical Sciences. 2nd ed.. (Wiley. New York. 1983). Chapter 10. 
Section 9. 
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FIGURE 4.1: Spherical coordinates: radius r, polar angle 0, and azimuthal angle ¢. 


Putting this into Equation 4.14, we have 


h? [Yd n). R 98 (sino) a R 9Y 
————— e aimed r“ — oy T aS ee 
2m | r2 dr dr r? sinó 30 30 r? sin? 6 09? 


+VRY = ERY. 


Dividing by RY and multiplying by —2mr?/h?: 


| ld ( 2) 2mr^ V 2l 
—— (f a iine En Pe 
a dr h? 


x l 1 8 sind ) " 1 3Y j 
TE —Ó — Sl — — — = 
Y |sin@ 30 30 sin? 0 89? 


The term in the first curly bracket depends only on r, whereas the remainder 
depends only on 0 and $; accordingly, each must be a constant. For reasons that 
will appear in due course,” I will write this “separation constant" in the form /(/+1): 


l d f -dR 2mr? 

Eres eed us V(r) — E] 2 l0 + 1): 4.16 

Rdr ( T» h? un ER ! 
1 1 8 oY 1 8?Y 
oe ere ee eee ecd. 4.17 
y lamb (sin e) xc a | inan med 


3Note that there is no loss of generality here—at this stage / could be any complex number. 
Later on we'll discover that / must in fact be an iueger. and it is in anticipation of that result that I 
express tlie separation constant in a way that looks peculiar now. 
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*Problem 4.2 Use separation of variables in cartesian coordinates to solve the 
infinite cubical well (or “particle in a box"): 


0, if x, y.z are all between 0 and a: 
oo, otherwise. 


Vez = | 


(a) Find the stationary states, and the corresponding energies. 


(b) Call the distinct energies Ej. E». E3, ... , in order of increasing energy. 
Find Ej. E». E3, E4. E5, and Eg. Determine their degeneracies (that is, the 
number of different states that share the same energy). Comment: In one 
dimension degenerate bound states do not occur (see Problem 2.45), but in 
three dimensions they are very common. 


(c) What is the degeneracy of Ej4, and why is this case interesting? 


4.1.2 The Angular Equation 


Equation 4.17 determines the dependence of y on 0 and $; multiplying by Y sin? 0, 
it becomes: 


8 9?Y 
sind (sino ) + 393 = —/(1 + 1) sin? 0Y. [4.18] 


You might recognize this equation — it occurs in the solution to Laplace's equation 
in classical electrodynamics. As always, we try separation of variables: 


Y (0, $) = © (0) (ġ). [4.19] 
Plugging this in, and dividing by ©Ẹ, we find: 
1f.,d/. dO l 1 d?o 
la sino (sine >) tK + Isin? 0] $ doi —0 
The first term is a function only of 0, and the second is a function only of $, so 
each must be a constant. This timet I'll call the separation constant m7: 


l|. d/. dO 9 2. 

S [sine (sino) FI +1) sin“ 0 = mî: [4.20] 
1 do 
2527 =m’. [4.21] 


3 Again. there is no loss of generality here, since at this stage x could be any complex number: 
in a moment. though, we will discover that » must in [act be an eger. Beware: The letter. is 
now doing double duty. as mass and as a separation constant. There is no graceful way to avoid this 
confusion, since both uses are standard. Some authors now switch to M or u for mass, but I hate to 
change notation in mid-stream, and I don't think confusion will arise. as long as you are aware of the 
problem. 
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The @ equation is easy: 


—=-m’ > d$($)- e", [4.22] 

de? 
[Actually, there are two solutions: exp(im$) and exp(—im4), but we'll cover the 
latter by allowing m to run negative. There could also be a constant factor in 
front, but we might as well absorb that into ©. Incidentally, in electrodynamics we 
would write the azimuthal function (®) in terms of sines and cosines, instead of 
exponentials, because electric potentials must be real. In quantum mechanics there 
is no such constraint, and the exponentials are a lot easier to work with.] Now, 
when @ advances by 277, we return to the same point in space (see Figure 4.1), so 
it is natural to require that? 


Dlo 4-2x) = (9). [4.23] 


In other words, exp[im(@ + 27)] = exp(im), or exp(2zrim) = 1. From this it 
follows that 7 must be an integer: 


m —0, t1, £2,.... [4.24] 
The 0 equation, 
sin 0 a d + [I + 1) sin? @ — m?]O = 0 [4.25] 
16 do : 0 = UV. * 


is not so simple. The solution is 
©(0) = A P" (cos), [4.26] 


where P/" is the associated Legendre function, defined by$ 


l| 
PP (x) = (1 — x?) ( =) Pi(x). [4.27] 


and Pj(x) is the /th Legendre polynomial, defined by the Rodrigues formula: 


1 E a 
PO = T (=) (x^ — D. [4.28] 


dx 


"This is more slippery than it looks. After all. the probability density (||) is single-valued 
regardless of m. In Section 4.3 we'll obtain the condition on w by an entirely different—and more 
compelling —argument. 


Notice that Pj" = PP. Some authors adopt a different sign convention for negative values 
of m: sce Boas (footnote 2). p. 505. 
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TABLE 4.1: The first few Legendre polynomials, Pj(x): (a) functional form, 
(b) graphs. 


(35x? - 30x? + 3) 


EXE 
8 
P, = 16333 — 70x? + 15x) 


For example, 
l d 
P(x) = 1. Pi(x)= 2 —)-2x. 


Dod o T NE 
FAC) = 9 als ) (x^ — 1) = 5 Gx — ]). 


and so on. The first few Legendre polynomials are listed in Table 4.1. As the name 


suggests, P(x) is a polynomial (of degree /) in x, and is even or odd according to 
the parity of /. 2B P/" (x) is not, in general, a polynomial—if m is odd it carries 


a factor of /1 — x2: 
l ] 
P$) = 53x" -). Blo)j2u-x5^2-. "iE (3x? — D| —3xV1- x2, 


P?) = (1 — x?) (=) he = J =3(1 - x°), 
2 dx 2 


etc. (On the other hand, what we need is P;"(cos@), and v1 — cos? 6 = sin, so 
Pi" (cos 0) is always a polynomial in cone! multiplied—if m is odd—by sin. 
Some associated Legendre functions of cos are listed in Table 4.2.) 
Notice that / must be a nonnegative integer, for the Rodrigues formula to 
make any sense; moreover, if |m| > /, then Equation 4.27 says P;" = 0. For any 
given /, then, there are (2/ + 1) possible values of m: 


1-0, 1. 2,...; m--l.—lt1. .... —1.0. 1, ....1—1. 1. [429] 


But wait! Equation 4.25 is a second-order differential equation: It should have two 
linearly independent solutions, for any old values of / and m. Where are all the 
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TABLE 4.2: Some associated Legendre functions, P/" (cos @): (a) functional form, 
(b) graphs of r = P/"(cos@) (in these plots r tells you the magnitude of the function in 
the direction 0; each figure should be rotated about the z-axis). 


ies = sco 6- b Pic) PHO) 

Pl — sin8 3 = 15 sin 6(1 — cos? 6) f : 

P?) = cos 0 z= 15sin?6 cos 0 PAA) pi 

P3-3sin'8 {= isin8( cos” 8- 1) : - 

Pi=3sin@ cos@ P= 6 cos? 0 — 3 cos 0) 1#) eo PHO) 
(a) (b) 


other solutions? Answer: They exist, of course, as mathematical solutions to the 
equation, but they are physically unacceptable. because they blow up at 0 = 0 
and/or 6 = z (see Problem 4.4). 

Now, the volume element in spherical coordinates’ is 


dr — r^ sinb dr dé dọ. [4.30] 
so the normalization condition (Equation 4.6) becomes 
J IY r^ sine dr de do = l |a? ar f |Y |^ sine dê dọ = 1. 


It is convenient to normalize R and Y separately: 


eo O MN T 5 
Í |R|^r^ dr =i and | Í |Y |* sin0 d0 d$ = 1. [4.31] 
0 0 0 


7Sce. Tor instance, Boas (footnote 2). Chapter 5. Section 4. 
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TABLE 4.3: The first few spherical harmonics, Y” (0.6). 


12 UA. " 
) (3 cos?8 — 1) f sin? 0 cos e *7#@ 


5\2 "m U3 . 3 5 aid 
sin 0 cos ĝe” P-*l sir EE 
j 7 


8 


The normalized angular wave functions? are called spherical harmonics: 


(21 4- 1) (0 — |m|)! 
4x — (l4 |m]|)! 


Y?" (0, $) =€ 


e"? p? (cos 9). [4.32] 


where e = (—1)" for m > 0 and € = 1 for m < 0. As we shall prove later on, 
they are automatically orthogonal, so 


2m prn 
[ [ere one. oy sina dodo phu. 14331 
0 0 


In Table 4.3 I have listed the first few spherical harmonics. For historical reasons, / 
is called the azimuthal quantum number, and m the magnetic quantum number. 


*Problem 4.3 Use Equations 4.27, 4.28, and 4.32, to construct i and yj. Check 
that they are normalized and orthogonal. 


Problem 4.4 Show that 
Q(0) = A In[tan(0/2)] 


8The normalization factor is derived in Problem 4.54: € (which is always | or —1) is chosen 
for consistency with the notation we will be using in the theory of angular momentum: it is reasonably 
standard, though some older books use other conventions. Notice that 


! 4846 = (-1)"” (y")* ] 
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satisfies the @ equation (Equation 4.25), for / = m = 0. This is the unacceptable 
"second solution" — what's wrong with it? 


xProblem 4.5 Use Equation 4.32 to construct Yl (8, @) and Y2(8. Q). (You can take 
P? from Table 4.2, but you'll have to work out pl from Equations 4.27 and 4.28.) 
Check that they satisfy the angular equation A 4.18), for the appropriate 
values of / and m. 


x «Problem 4.6 Starting from the Rodrigues formula, derive the orthonormality con- 
dition for Legendre polynomials: 


l 2 
I P(x) Py (x) di = (zi) by". [4.34] 


Hint: Use integration by parts. 


4.1.3 The Radial Equation 


Notice that the angular part of the wave function, Y (0. @), is the same for all spher- 
ically symmetric potentials; the actual shape of the potential, V(r), affects only 
the radial part of the wave function, R(r), which is determined by Equation 4.16: 


d adR amr B 
dr ( d )-* [V(r) —EJR=/0 + 1)R. [4.35] 


This equation simplifies if we change variables: Let 
u(r) =rR(r), [4.36] 
so that R = u/r, dR/dr = [r(du/dr) — uMr?, (d/dr)[r2(dR/dr)] = rd?u/dr?, 


and hence 
h? du h* 1d +1 
artus 3 ) u = Eu. [4.37] 


2m dr? 


This is called the radial equation;? it is identical in form to the one-dimensional 
Schrédinger equation (Equation 2.5), except that the effective potential, 


h? 1d +1 
ave, 
2m r? 


[4.38] 


9 
Those m°s are masses, of course—the separation constant m does not appear in the radial 
equation. 
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contains an extra piece, the so-called centrifugal term, (42/2m){/(/ + 1)/r?]. It 
tends to throw the particle outward (away from the origin), just like the centrifu- 
gal (pseudo-)force in classical mechanics. Meanwhile, the normalization condition 
(Equation 4.31) becomes 


T 
f lu? dr — 1. [4.39] 
0 


That's as far as we can go until a specific potential V(r) is provided. 


Example 4.1 Consider the infinite spherical well, 


K ifr <a: 
veyed” Trga [4.40] 
oo. ifr>a. 


Find the wave functions and the allowed energies. 


Solution: Outside the well, the wave function is zero; inside the well, the radial 


equation says 
d?u Id 4- 1) 2 
ape = is —k uy [4.41] 


where 
V2mE 


k 
fi 


: [4.42] 


as usual. Our problem is to solve this equation, subject to the boundary condition 
u(a) = 0. The case / = 0 is easy: 


E) > : 
m = —k^u => u(r) = Asin(kr) + B cos(kr). 
r2 


But remember, the actual radial wave function is R(r) = u(r)/r, and [cos(kr)]/r 
blows up as r — 0. So!? we must choose B = 0. The boundary condition then 
requires sin(ka) = 0, and hence ka = nz, for some integer n. The allowed energies 
are evidently 
2 9232 
nn he 


Eno = (n = 1.2.3,...), [4.43] 


2ma? ` 


10 Actually. all we require is that the wave function be normalizable. not that it be finite: R(r) ~ 
l/r at the origin is normalizable (because of the r? in Equation 4.31). For a more compelling proof 
that B = 0. sec R. Shankar. Principles of Quantum Mechanics (Plenum. New York. 1980). p. 351. 
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the same as for the one-dimensional infinite square well (Equation 2.27). Normal- 
izing u(r) yields A = /2/a; tacking on the angular part (trivial, in this instance, 
since Y9(0. 6) = 1/4/47), we conclude that 


1 sin(mrr/a) 


0 E ———— 
Vn 0 xd " 


[Notice that the stationary states are labeled by three quantum numbers, n, /, and 
m: Wnim(r, 0, à). The energy, however, depends only on n and l: En] 

The general solution to Equation 4.41 (for an arbitrary integer /) is not so 
familiar: 


[4.44] 


u(r) = Arji(kr) + Brni(kr). [4.45] 


where jj(x) is the spherical Bessel function of order l, and r;(x) is the spherical 
Neumann function of order /. They are defined as follows: 


lxx. l N 
jx) = (=x)! (<=) = ni(x) = —(—x)! G5) COSA [4.46] 


xdx xdx x 


For example, 


. sin x COS X 
Jo(x) = mee no(x) = — f 


. | d fsinx sinx  cosx 
e - cot ( )- ES 


x dx X xz X 
bee 1 d M sinx ,f(ld ) xcosx — sinx 
X) = (—x) | -— —— = y^ | —— | — 
A x dx x xdx x? 


. PD 
3sinx —3xcosx — x^ sinx 


x? 


and so on. The first few spherical Bessel and Neumann functions are listed in 


Table 4.4. For small x (where sinx = x — x?/3! + x?/5! — ... and cosy = 
] —x?/2 + x*/4! —---), 
(OMe mee + 1082: due 
x) ~l; X) ——i xX) =: X) & —i 
JO > Ao P Jt 3 J2 15 


etc. Notice that Bessel functions are finite at the origin, but Neumann functions 
blow up at the origin. Accordingly, we must have B; = 0, and hence 


R(r) = Aji(kr). [4.47] 
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TABLE 4.4: The first few spherical Bessel and Neumann functions, 
Jn (X) and nj (x); asymptotic forms for small x. 


sin x 


i= 


cosx sinx 
n=- — 


X 


LAC. 3 l Soa 
y J SX — -z cosx Ny =- —|cosx— —5 sinx 
X, x* E I XT 


j 2! d : for x << J 
47 0m: pee . 


There remains the boundary condition, R(a) = 0. Evidently k must be chosen 
such that 
ji(ka) = 0; [4.48] 


that is, (ka) is a zero of the /th-order spherical Bessel function. Now, the Bessel 
functions are oscillatory (see Figure 4.2); each one has an infinite number of zeros. 


-.3 


FIGURE 4.2: Graphs of the first four spherical Bessel functions. 
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But (unfortunately for us) they are not located at nice sensible points (such as r. 
or nz, or something): they have to be computed numerically.!! At any rate, the 
boundary condition requires that 


l 
k = ~ By. [4.49] 
a 


where f,; is the nth zero of the /th spherical Bessel function. The allowed energies, 
then, are given by 


-Ë og 4.50 
nl — Sg? Pu [ . ] 

and the wave functions are 
Vus (F. 0. 0) = Anı Ji Byir/a) Yj" (0. Q$). [4.51] 


with the constant A, to be determined by normalization. Each energy level is 
(21 + 1)-fold degenerate, since there are (2/ + 1) different values of m for each 
value of / (see Equation 4.29). 


Problem 4.7 


(a) From the definition (Equation 4.46), construct 5; (x) and n2(x). 


(b) Expand the sines and cosines to obtain approximate formulas for n(x) and 
na(x), valid when x < 1. Confirm that they blow up at the origin. 


Problem 4.8 


(a) Check that Ari (kr) satisfies the radial equation with V(r) — 0 and / = 1. 


(b) Determine graphically the allowed energies for the infinite spherical well, 
when / = 1. Show that for large n, Ey; ~ (h?z?/2ma?)(n + 1/2)". Hint: 
First show that ji (x) = 0 — x = tan x. Plot x and tan x on the same graph, 
and locate the points of intersection. 


!! Abramowitz and Stegun. eds.. Handbook of Mathematical Functions, (Dover. New York. 1965). 
Chapter 10. provides an extensive listing. 
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x «Problem 4.9 A particle of mass m is placed in a finite spherical well: 


— V i : 
V(r) = phe 
0. if r a. 


Find the ground state, by solving the radial equation with / = 0. Show that there 
is no bound state if Vou? < zx?h?/8m. 


4.2 THE HYDROGEN ATOM 


The hydrogen atom consists of a heavy, essentially motionless proton (we may as 
well put it at the origin), of charge e, together with a much lighter electron (charge 
—e) that orbits around it, bound by the mutual attraction of opposite charges (see 
Figure 4.3). From Coulomb's law, the potential energy (in SI units) is 


2] 


e 
V(r)2-— 


-. 4.52 
4T eo r 


and the radial equation (Equation 4.37) says 


h? d?u e 1 P+) 
ER E E N E c 4.53 
2m dr? v | 4regr S 2m r? i " [4.33] 


Our problem is to solve this equation for u(r), and determine the allowed energies, 
E. The hydrogen atom is such an important case that I’m not going to hand you the 
solutions this time— we'll work them out in detail, by the method we used in the 
analytical solution to the harmonic oscillator. (If any step in this process is unclear, 
you may wish to refer back to Section 2.3.2 for a more complete explanation.) 


-e 
(electron) 


+e 
(proton) 


FIGURE 4.3: The hydrogen atom. 
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Incidentally, the Coulomb potential (Equation 4.52) admits continuum states (with 
E > 0), describing electron-proton scattering, as well as discrete bound states, 
representing the hydrogen atom, but we shall confine our attention to the latter. 


4.2.1 The Radial Wave Function 


Our first task is to tidy up the notation. Let 


/—2mE 


[4.54] 


Il 


K 


(For bound states, E is negative, so « is real.) Dividing Equation 4.53 by E, we 


have 5 ) 
1 d-u me’ 1 I(l + P 
—— = me —— |u 
K? dr? 2zegh?k (kr) (Kr)? 


This suggests that we introduce 


=kxr, and Z a 4.55 

p l po 2zeoh?k | 
so that Pp ja ; 
^u po ( 4 1 

— = | | — — + —— |u. 4.56 

dp? | p p IL pem 


Next we examine the asymptotic form of the solutions. As p — oo, the 
constant term in the brackets dominates, so (approximately) 


d?^u 
dp? = M. 
The general solution is 
u(p) = Ae ? + Be’. [4.57] 
but e? blows up (as p — co), so B = 0. Evidently, 


u(p) ~ Ae ^. [4.58] 


for large p. On the other hand, as p — 0 the centrifugal term dominates;!? approx- 
imately, then: 
du  d(l41) 
= —— NN. 


do o 


I? This argument does not apply when / = 0 (although the conclusion, Equation 4.59, is in fact 
valid for that case too). But never mind: All 1 am trying to do is provide some motivation for a change 
of variables (Equation 4.60). 
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The general solution (check it!) is 
u(o) = Cp't! + Dp". 
but p blows up (as p — 0), so D = 0. Thus 
u(p) ~ Cpl *!. [4.59] 


for small p. 
The next step is to peel off the asymptotic behavior, introducing the new 
function v(p): 
u(p) = p'*le^?v(p). [4.60] 
in the hope that v(o) will turn out to be simpler than u(o). The first indications 
are not auspicious: 


du ep dv 
— —pe (7¢+1—p)v+p—}. 
do do 


and 


d^u E I+ d dv d*v 
—z = ple? sof) acque pue pe ee : 
dp? p dp dp* 


In terms of v(p), then, the radial equation (Equation 4.56) reads 


d?^v 
dp? 


d 
p)— + [oo — 2( + Jv = 0. [4.61] 
dp 


Finally, we assume the solution, v(p), can be expressed as a power series 
in p: 


v(p) = 5 cjp!. [4.62] 


Our problem is to determine the coefficients (co. c1. c2. ...). Differentiating term 
by term: 


dv ia io pl 
— =) jejp/ Siting! 
j= j=0 


[In the second summation I have renamed the “dummy index": j — j+ 1. If this 
troubles you, write out the first few terms explicitly, and check it. You may object 
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that the sum should now begin at j = —1, but the factor (j + 1) kills that term 
anyway, so we might as well start at zero.] Differentiating again, 


d?v = i4 

a=) JU-FDemie 
2 

dp pr 


Inserting these into Equation 4.61, we have 


oo fere 
SAG Dejapl 20 4 0 Y 1G + Deje! 


oo ec 
=2 NEST t [oo — 20 + 1)] $ exp! = 0. 
j=0 j=0 


Equating the coefficients of like powers yields 
FG + Deja +20 4+ DG + Deja — 2jej + [oo — 20+ Die; = 0. 


or: 
2(j +/+ 1) — po 
is -loc bg zi» St pu 
This recursion formula determines the coefficients, and hence the function v(p): 
We start with co (this becomes an overall constant, to be fixed eventually by 
normalization), and Equation 4.63 gives us cı; putting this back in, we obtain c», 
and so on.!3 
Now let's see what the coefficients look iike for large j (this corresponds to 
large p, where the higher powers dominate). In this regime the recursion formula 
says!4 
RE^ ee A 
SECTIO D ge 


I5 You might wonder why 1 didn't use the series method directly on «(o)—why factor out the 
asymptotic behavior before applying this procedure? Well, the reason for peeling off p/*! is largely 
aesthetic: Without this, the sequence would begin with a long string of zeros (the first nonzero coefficient 
being cj.) ): by factoring out p/*^! we obtain a series that starts out with 9. The e^? factor is more 
critical —if you don't pull that out, you get a three-term recursion formula. involving cj42. cj 44. and 
cj ry it!) and that is enormously more difficult to work with. 


'4You might ask why I don't drop the | in j + 1— after all, I am ignoring 2(/ + 1) — pp in the 
numerator. and 2/ 4- 2 in the denominator. In this approximation it would be fine to drop the 1 as well. 
but keeping it makes the argument a little cleaner. Try doing it without the L. and you'll see what I 
mean. 
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Suppose for a moment that this were exact. Then 


2] 
Cj = 7 e0 [4.64] 
; j! 
so PEON 
24 4 
v(p) — co D ue = coe”, 
jo ^' 
and hence 
u(p) = cop! *le^, [4.65] 


which blows up at large p. The positive exponential is precisely the asymptotic 
behavior we didn't want, in Equation 4.57. (It's no accident that it reappears here; 
after all, it does represent the asymptotic form of some solutions to the radial 
equation—they just don't happen to be the ones we're interested in, because they 
aren't normalizable.) There is only one way out of this dilemma: The series must 
terminate. There must occur some maximal integer, jmax, such that 


Cl jax tl) = 0. [4.66] 
(and beyond which all coefficients vanish automatically). Evidently (Equation 4.63) 
2(jmax +/+ 1) — po = 0. 


Defining 
n= Jmax +! +1 [4.67] 


(the so-called principal quantum number), we have 
po = 2n. [4.68] 
But po determines E (Equations 4.54 and 4.55): 


2.) | 
h^k* me? 


[4.69] 


EDO CREE UIN EU TUN. 
2m 8z teh" pg 


so the allowed energies are 


? 2 E 

m e i 
Epl E En 4.70 
" m (zx) ^ y [4.70] 
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This is the famous Bohr formula-—by any measure the most important result in. 
all of quantum mechanics. Bohr obtained it in 1913 by a serendipitous mixture 


of inapplicable classical physics and premature quantum theory (the Schrédinger 


equation did not come until 1924). 


Combining Equations 4.55 and 4.68, we find that 


| me X1] 1 
(Artus [4.71] 
A4meoh^/ n an 
where 
E a — 0.529 x 10-18 m [4.72] 
me 


is the so-called Bohr radius.> It follows (again, from Equation 4.55) that 


nus. [4.73] 
an 


The spatial wave functions for hydrogen ate labeled by three quantum numbers (, 
i, and m): 
Vnim Gr, 0, Ø) = Rar) Yr (06, $), [4.74] 


where (referring back to Equations 4.36 and 4.60) 
. l ' . l 
Rar) = —p ^ e ^v(p), [4.75] 
and v(p) is a polynomial of degree jmax — n —1 — 1 in p, whose coefficients are 
determined (up to an overall normalization factor) by the recursion formula 


2(j 4L L— m) 


p [4.76 
G+DG+U+427 [4.76] 


Cj = 


The ground state (that is, the state of lowest energy) is the case n = 1; putting 
in the accepted values for the physical constants, we get: 


joy ad 
Ell — 13.6 eV. 4.77 
| E: (=) | : Feud 


ISYt is traditional to write. the Bohr radius with a subscript: ag. But this is cumbersome and 
untiecessary, so. J prefer to leave the subscript. off. 


e em eet RC asi i cd 
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Evidently the binding energy of hydrogen (the amount of energy you would have. 
to impart to the electron in the ground state in order to ionize the atom) is. 13.6 eV. 
Equation 4.67 forces } =.0, whence also m = 0 (see. Equation 4.29), so 


Vioo(r, 0, 9) = Rio(r) YG @, $). [4.78] 


The recursion formula truncates after the first term (Equation 4.76 with j = 0 
yields cy = 0), so v(o) is a constant (co), and 


Rio(r) = » pu [4.79] 


Normalizing it, in accordance with Equation 4.31: 


foe " co 2 oO, 5a 
| [Rio|?r? dr = | ] f e 7/9 t? dr = eg? =1, 
JO a^ Jo 4 


so co = 2/ ya. Meanwhile, Y? = 1/4/47, and hence the ground state of hydrogen 
is 


Viol, 0, p) = tao ; [4.80] 


If n = 2 the energy is 


—13.6 eV 
Ei —— = —34 eV: [4.81] 
this is the first excited state—or rather, states, since we can have either / = 0 (in 
which case m = 0) or / = 1 (with m = —1, 0, or +1); evidently four different 


states share this same energy. If / = 0, the recursion relation (Equation 4.76) gives 
cj = —co (using j — 0), and co — 0 (using j = 1), 
so v(p) = c(1 — p), and therefore 
' co FC —r/2a 

Ry) = == (1-2) e^. 4.82 

Rao(r) aN ae n [4.82] 
[Notice that the. expansion coefficients {cj} are completely different for different 
quantum numbers # and /.] If / = 1 the recursion formula terminates the series 


after a single term; v(p) is a constant, and we find 


Rar) = 


O eta [4.83] 
4a2 


(In each case the constant co is to be determined by normalization—see Prob- 
lem 4.11.) 
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For arbitrary n, the possible values of / (consistent with Equation 4.67) are 
?=0,1,2,...,4a—-—1, [4.84] 


and for each / there are (21 + 1) possible values of m (Equation 4.29), so the total 
degeneracy of the energy level E; is 


n—l 
dq) = 5 Qr) =n’. [4.85] 
1-0 


The polynomial v(o) (defined by the recursion formula, Equation 4.76) is a function 
well known to applied mathematicians; apart from normalization, it can be written 
as 

v(p) = L2:*! (2p), [4.86] 


where 


p Ru adiu 
LP? a) 2 (7-1)? =) Lx) [4.87] 


E105 a, 
(Y) = ot : e yu 
Lg (x) =e - (e x ) [4.88] 


is the qth Laguerre polynomial.! (The first few Laguerre polynomials are listed 
in Table 4.5; some associated Laguerre polynomials are given in Table 4.6. The 
first few radial wave functions are listed in Table 4.7, and. plotted in Figure 4.4.) 
The normalized liydrogen wave functions are!? 


TAS 2M (-1- n0! ,—rjna ( 2F / 2L joniga yg A 
E - / (s) arna Gs) E Grime] Yeo. 


They are not pretty, but don’t complain—this is one of the very few realistic 
systems that can be solved at all, in exact closed form. Notice that whereas the 
wave functions depend on all three quantum numbers, the energies (Equation 4.70) 
are determined by 7. alone. This is a peculiarity of the Coulomb potential; in the 


[4.89] 


16 Ag usual, there. are rival normalization conventions in the literature; 1 have adopted the most 
nearly. standard. one. 


Er you want to sce how the normalization factor is calculated, study (for example), L. Schiff, 
Quantum Mechanics, 2nd-ed., (McGraw-Hill, New York, 1968), page 93. 
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TABLE 4.5: The first few Laguerre polynomials, La(x). 


= E 
Lj-—x-«1 

In= x?-4x42 

I4-2—) + 9x? — 18x + 6 

L,-x* — 16x? + 72x? — 96x +24 

Ls = i + 25x4 — 20053 + 600x? — 600x + 120 

Lg 2 36 — 3632 + 450x* — 2400x3 + 5400: — 4320x + 720 


TABLE 4.6: Some associated Laguerre 
polynomials, f p (x). 


LEE P 
L5-i Lj=2 


L9=—x+1] Li- -6x+ 18 


LP =x? -4x42 L2 = 1232 — 96x + 144 


Lisi Là-6 


Li =-2x+4 Li- -24x +96. 
L} = 3x? — 18x + 18 L3 = 60x? — 600x + 1200 


case of the spherical well, you may recall, the energies depend also on J (Equa- 
tion 4.50). The wave functions are mutually orthogonal: 


J Vm Wallin’ r? sin 0 dr do dọ = Onn! On’ Ômm' - [4.90] 


This follows from the orthogonality of the. spherical harmonics (Equation 4.33) 
and (for m # n’) from the fact that they are eigenfunctions of H with distinct 
eigenvalues, 

Visualizing the hydrogen wave functions is not easy. Chemists like to draw “den- 
sity plots,” in which the brightness of the cloud is proportional to |y i4 (Figure 4.5). 
More quantitativé (but perhaps hàrder to read) are surfaces of constant probability 
density (Figure 4.6). 


xProblem 4.10 Work out the radial wave functions. R30, R31, and R32, using the 
recursion formula (Equation 4.76). Don't bother to normalize them. 
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TABLE 4.7: The first few radial wave functions for hydrogen, 
Rit). 


R19 =2a 3” exp(-r/a) 


s ( " i Z ) exp (-r/2a) 


=l git Z exp (=r/2a) 
24 i 


= (£y) exp (-7/3a) 


E ) exp (—7/3a) 


i encre 


Ry = a zu -1 3 2pm zy) Z exp (—r/4a) 


42 a per ae y 
Ras = a 1 ex /4a 
42 645 1 2 i p( r/4a) 


Ras —l a3? yos Ed 
Rao er (£)'exp Coto) 


xProblem 4.11 
(a) Normalize R»o (Equation 4.82), and construct the fonction W200: 


(b) Normalize R2, (Equation 4.83), and construct 3/211, V/210, and 3/211. 


xProblem 4.12 
(a) Using Equation 4.88, work out the first four Laguerre polynomials. 
(b) Using Equations 4.86, 4.87, and 4.88, find v(o), for the case n = 5, 1 = 2. 


(c) Find v(o) again (for the case n = 5, I = 2), but this time get it from the 
recursion formula (Equation 4.76). 
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FIGURE 4.4: Graphs of the first few hydrogen radial wave functions, R,,)(7). 


*Problem 4.13 


(a) Find (r) and (r?) for an electron in the ground state of hydrogen. Express 
your answers in terms of the Bohr radius. 

(b) Find (x) and (x?) for an electron in the ground state of hydrogen. Hint: This 
requires no new integration— note that r? = x? y? +z, and exploit the 
symmetry of the ground state. 

(c) Find (x2) in the state n = 2, L = 1, m = 1. Warning: This state is not 
symmetrical in x, y, ze Use x = r sino cos @. 
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(4,1,0) (4,2,0) (43,0) 


FIGURE 4.5: Density plots for the hydrogen wave functions (7, I, m), Imagine each 
plot to be rotated about the (vertical) z axis. Printed by permission using *Atom in.a 
Box," v1.0.8, by Dauger Research. You can make your own plots by going to the Web 
site http:// dauger. com. 


Problem 4.14 What is the most probable value of r, in the ground state of hydro- 
gen? (The answer is not zero!) Hint: First you must figure out the probability that 
the electron would be found between r and r + dr. 


Problem 4.15 A hydrogen atom starts out in the following linear combination of 
the stationary states n = 2, 1 = 1, m.= 1 and n 22, I = 1, m = —1: 


2 | NE 
Wr, 0) = — (p21 1 + V21-1). 
Bee 
(a) Construct W(r, t). Simplify it as much as you can. 


(b) Find the expectation value of the potential energy, (V). (Does it depend on 
t?) Give. both the formula and the actual number, in electron volts. 
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and moving down.!® In practice such perturbations are always present; transitions 
(or, as they are sometimes called, "quantum jumps”) are constantly occurring, and 
the result is that a container of hydrogen gives off light (photons), whose energy 
corresponds to the difference in energy between the initial and final states: 


fi d 
E; = E; — Ey =—13.6 €V| 5 —; [4:91] 
ne Ny 


Now, according to the Planck formula,!? the energy of a photon is propor- 
tional to its frequency: 
Ey — liv. [4.92] 


Meanwhile, the wavelength is given by A — c/v, so 


1 l l 1 l 
n E 2: [4.93] 
where 
sc Ye 
R = — (=) = 1.097 x 10^ m^! [4.94] 
Anch? Mireo 


is known as the Rydberg constant. Equation 4.93 is the Rydberg formula for the 
spectrum of hydrogen; it was discovered empirically in the nineteenth century, and 
the greatest triumph of Bohr’s theory was its ability to account for this result—and 
to calculate R in terms of the fundamental constants of nature. Transitions to the 
ground state (ng = 1) lie in the ultraviolet; they are known to spectroscopists as 
the Lyman series. Transitions to the first excited state (ny = 2) fall in the visible 
region; they constitute the Balmer series. Transitions to n; = 3. (the Paschen 
series) are in the infrared; and so on (see. Figure 4.7). (At room temperature, most 
hydrogen atoms are in the ground state; to obtain the emission spectrum you. must 
first populate the various excited states; typically this is done by passing an electric. 
spark through the gas.) 


«Problem 4.16 A hydrogenic atom consists of a single electron orbiting a nucleus 
with Z protons (Z = I would be. hydrogen itself, Z = 2 is ionized helium, Z = 3 


ISBy its nature, this itivolves a time-dependent interaction, and the details will have to wait for 
Chapter 9; for our present purposes the actual mechanism involved is: immaterial. 

I? The photon is a quantum of ¢léctromagnetic radiation; it’s a relativistic object.if there ever was 
one, and therefore outside the scope of nonrelativistic quanturn mechanics. It will be useful in a. few 
places to:speak of photons, and to invoke the Planck formula for their cnergy, but please bear in mind 
that this is external to the. thcory we are developing. 
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Energy (eV) 


-9.0 
—10.0 
-11.0 
-12.0 
—18.0 


—14.0 — Lyman series 


FIGURE 4.7: Enetgy levels and transitions in the spectrum of hydrogen. 


is doubly ionized lithium, and so on). Determine the Bohr energies E,(Z), the 
binding energy Et1(Z), the.Bohr radius a(Z), and the Rydberg constant. R(Z) for a 
hydrogenic atom. (Express your answers as appropriate multiples of the hydrogen 
values.) Where in the. electromagnetic spectrum would the Lyman series fall, for 
Z — 2.and Z — 3? Hint: There's nothing much to calculate here—in the potential 
(Equation 4.52) e? — Ze’, so all you have to do is make the same substitution in 
all the final results. 


Problem 4.17 Consider the earth-sun system às a gravitational analog to the hydro- 
gen atom. 


(a) What is the potential energy function (replacing Equation 4.52)? (Let m be 
the mass of the earth, and M the mass of the sun.) 


(b) What is the “Bohr radius,” ag, for this system? Work out the actual number. 
5 Ag Y 


(c) Write down the gravitational "Bohr formula," and, by equating E, to the 
classical energy of a planet in a circular orbit of radius fj, Show that n = 
V/ro/ag. From this, estimate the quantum number n of the earth. 
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(d) Suppose the earth made a transition. to the next lower level (n — 1). How 
much energy (in Joules) would be released? What would the wavelength of 
the emitted photon (or, more likely; graviton) be? (Express your answer in 
light years—is the remarkable answer?? a coincidence?) 


4.3 ANGULAR MOMENTUM 


As we.have seen, the stationary states of the hydrogen atom are labeled by three 
quantum numbers: n, J, and m. The principal quantum number (n) determines the 
energy of the state (Equation 4.70); as it turns out, 7 and m are related to the orbital 
angular momentum. In the classical theory of central forces, energy ánd angular 
momeritum are the fundamental conserved quantities, and it is not surprising that 
angular momentum plays a significant (in fact, even more important) role in the 
quantum theory. | 
Classically, the angular momentum of à particle (with respect to the origin) is 
given by tlie formula 
L=rxp, [4.95] 


which is to say, 
Ly = ypz—2py,  Ly-—zpx—xp; Lz = Xpy — YPx- [4.96] 


The corresponding quantum operators are obtained by the standard prescription 
ps > —ihd/ox, py > —ihd/dy, p; — —ihd/dz. In the following section we'll 
obtain the eigenvalues of the angular momentum operators. by a. purely algebraic. 
technique reminiscent of the oie we used in Chapter 2 to get the allowed energies 
of the harmonic oscillator; it is all based on the clever exploitation of commutation 
relations. After that we will turn to the more difficult problem ‘of determining the 
eigenfunctions. 


4.3.1 Eigenvalues 


The operators Ly and Ly, do not commute; in fact”! 


[Ly, Ly] = [ypz — zpy, ZPx — XPz] 
= [ypz, zPx1 — [ypc xpz] — [zpy, zPx] + [zpy, xp. — [4.97] 


20 Thanks to.John Meyer for pointing this out. 

2! Note: that all the operators we encounter in quantim mechanics (footnote 15, Chapter 1) are 
distributive with respect to addition: A(B + C) = AB + AC. In particular, [A, B + C] = [A. B] + 
[A, C]. 


Section 4.3: Angular Momentum | 161 


From the canonical commutation relations (Equation 4.10) we know that the only 
operators here that fail to commute are x with px, y with py, and.z with pz. So 
the two middle terms drop out, leaving. 


[Lx Ly] = ypx[pz, z] + xpy[zi Pe] = iR(xpy — ypx) = iñ Lz. [4.98] 
Of course, we could have started out with [L,, Lz] or [L;, L], but there is no need 


to calculate these separately —we can get them immediately by cyclic permutation 
of the indices (€ > y, y > z, z > X): 


ek= MED] SERES [Eo EDS ALS. [4.99] 


These are the fundamental commutation relations. for angular momeritum; évery- 
thing else follows from them. 

Notice that Ly, Ly, and Lz are incompatible observables. According to the 
generalized uncertainty principle. (Equation 3.62), 


2 bs \2 g : 
oj Op. Z (nia) E 19. 


or h 
(GLOL, = DKL). — [4.100] 


It would therefore be futile to look for states that are simultaneously eigenfunctions: 
of Ly and Ly. On the other hand, the square of the total angular momentum, 


L? = L? +L +E, [4.101] 
does commute with Ly: 


(27, L] = (£2, Lx] + [L3, Le] +122, Lx] 
= Ly[Ly, Ly] + [Ly, Ly] Ly. TL [Lz, Ly] + [Lz Ly\L, 
= LyC-ihL;) + C-ihL;)L, + Lz(ihLy) + (ihLy)Lz 


Esp: 


(I used Equation 3.64 to simplify the commutators; note also that any operator 
commutes with itself.) It follows, of course, that L? also commutes with Ly and 
Lz: | i 

[^ Es] es. ESI ESSO; [ES Eos dt [4.102] 
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or, more compactly, v 
[L*,L] = 0. [4.103] 


So L? is compatible with each component of L, and we can. hope to find simulta- 
neous eigenstates of L^ and (say) Lz: 


LFA and Lf = uf. [4.104] 


We'll use.a “ladder operator" technique, very similar to the one we- applied to 
the harmonic oscillator back in Section. 2.3.1. Let. 


E mL; tiLy, [4.105] 


The commutator with L; is 
[Lc 4) 9 [Lu Fe) iD Ly] SihRLy Sua) = T AG TEIL), 


SO . 
[Lz L4] 2 t HL 4. [4.106] 


And, of course, g 
[L?,L4]=0. [4.107] 


Iclaim that if f is an eigenfunction of L? and Lz, so also is £+ f: Equation 4.107 


Says E 
EES SLE f LIO exa, [4.108] 


so L+ f is an eigenfunction of Le with the same eigenvalue 4, and Equation 4.106 
says. 


L(L4f) = (L,L4—L+L,)f+L4b,f = tals f+ (uf) 
= (u E RA f), [4.109] 


so L+ f is an eigenfunction of L} with the new eigenvalue u th. We call Ly 
the "raising" operator, because it. increases the eigenvalue of L; by f, and L— the 
“lowering” operator, because it lowers the eigenvalue by fi. 

For à given value of A, then, we obtain a "ladder" of states, with each 
"rung" separated from its neighbors by one unit of ^ in the eigenvalue of L, (see 
Figure 4.8). To ascend the ladder we apply the raising operator, and.to descend, the 
lowering operator. But this process cannot go on forever: Eventually we're going 
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Í 
£ 
A 
: 


FIGURE 4.8: The “ladder” of angular momentum states. 


to-reach a state for which the z-component exceeds the total, and that cannot be.22 
There must exist a "top rung," fr, such that? 


Life =0. [4.110] 


22Formailly, (L?) = (L3) + (2) + (L2), but (£2) = (FIL F) = (Lx f Lx f) = 0 (and likewise 
for Ly), so A = (2) + (LS) + u^ > u^. 


23 Actually, all we.can conclude is that L+ fp is fot normalizuble—ils norm could be infinite, 
instead of zero..Problem 4.18 explores this alternative. 
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Let AI be the eigenvalue of L; at this top rung (the appropriateness of the letter 
"I" will appear in a moment): 


EPS "ES e [4.111] 

Now, 

Ly Ly = (Ly Etl (Lr rib S L2 +L? x i(LSLy — LyLx) 
Sud ENDS. 

or, putting it the other way around, 

L? = L} Lẹ + Li FAL [4.112] 
It follows that 
Lf, = (LLa +L Fal fi = (04 RP HRI f; — PIG Dfe, 


and hence S 
A — RÀ 1). [4.113] 


This tells us the eigenvalue of L? in terms of the máximum eigenvalue of Lz. 
Meanwhile, there is also (for the same reason) à bottom rung, fp, such that. 


L fy —0. [4.114] 
Let Al be the eigenvalue of L; at this bottom rung: 
Lefo =l fo, L? fy = Mb. [4.115] 
Using Equation. 4.112, we have 
L? f, = (LiL + L hb) fo = (4 RT — RD fo = RIT — 1) fo, 


and therefore : 
X —nu-1). [4.116] 


Comparing Equations 4.113 and 4.116, we see that ZŒ + 1) = I(I — 1), so either 
1-141 (which is absurd—the bottom rung would be higher than the top rung!) 
or else 

EA [4.117] 
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Evidently the eigenvalues of L; are mh, where m (the appropriateness of this 
letter will also be clear in a moment) goes from —/ to +/ in N integer steps. In 
particular, it follows that / = —I + N, and hence / = N/2, so l must be an integer 
or a half-integer. The eigenfunctions are characterized by the numbers / and m: 


LF = P+ LOT = hm i, [4.118] 


where 
1—0, 1/2, 1, 3/2, ...; m-—-—l, -141,...,7-lL L [4.119] 


For a. given value of l, there-are 2/ + 1 different values of m (i.e, 2/ + 1 “rungs” 
on the “Jadder’’), 

Some people like to illustrate. this result with the diagram in Figure 4.9 
(drawn for the. case / — 2). The arrows are. supposed to represent possible angular 
momenta—in units of A they all have the same length //(/+ 1) (in this case 
4/6 = 2.45), and their z components are the allowed values of m (—2, —1, 0, 1, 
2). Notice that the. magnitude of the vectors (the radius of the sphere) is greater 
than the maximum z component! (In general, VI + 1) > 1, except for the “triv- 
ial" case | = 0.) Evidently you can’t get the angular momentum to point perfectly 
along the z direction. At first, this sounds absurd. “Why can't I just pick my axes 
so that z points along the direction of the angular momentum vector?" Well, to 
do this you would havé to know all three components simultaneously, and the 


FIGURE 4.9: Angular momentum states (for / = 2). 


a 


166 


Chapter 4 Quantum Mechanics in Three Dimensions 


uncértainty principle (Equation 4.100) says that's impossible. “Well, all right, but 
surely once in a while, by good fortune, I will just happen to aim my z-axis along 
the ditection of L.” No, no! You have missed the point. It’s not merely that you 
don't know all three components of L; there simply aren't three components—a 
particle just cannot have a determinate angular momentum vector, any more than it 
can simultaneously have a determinate position and momentum. If L, has a well- 
defined value, then Ly and Ly do not. It is misleading even to draw the vectors 
in Figure 4.9— at best they should be smeared out around the latitude lines, to 
indicate that Ly and. Ly are indeterminate. 

I hope you'te impressed: By purely algebraic means, starting with the fun- 
damental commutatiori relations for angular momentum (Equation 4.99), we have 
determined the eigenvalues of L? and L,—without ever seeing the eigenfunctions 
themselves! We turn now to the problem of constructing the eigenfunctions, but 
I should warn you that this is a much messier business. Just so you know where 
we're headed, I'll begin with the punch line: f = Y/" —the eigenfunctions of 
L? and L; are nothing but the old spherical harmonics, which we came upon by 

“a quite different route in Section 4.1.2 (that’s why I chose the letters / and m, 
of course). And I can now tell you why the spherical harmonics are orthogonal: 
They are eigenfunctions of hermitian operators (L? and L;) belonging to distinct 
eigenvalues (Theorem 2, Section 3.3.1). 


«Problem 4.18 The raising and lowering operators change the value of m by one 
unit: ba 
| Laf” = (ag T, [4.120] 
where A7" is some constant. Question: What is A”, if the eigenfunctions are to 
be normalized ? Hint: First show that Lẹ is the hermitian a es of L 4 (since 
L; and Ly ate observables, you may assume they are hermitian ... but prove it 


if you like); then use Equation 4.112. Answer: 


AP — hd 1) mi £ =A Ja zm m 4 n. [4.121] 


Note what happens at the top and bottom of the ladder (i.e., when you apply L+ 
to f or L— to F 


*Problem 4.19 


(a) Starting with the canonical commutation relations for position and: momentum. 


(Equation 4.10), work out the following commutators: 


[Lz, x] = ihy, [L;. y] = —ihx, [L;. z] = 0, 


1 2 4.122 
[D Pe pgs. Ie p semi prs 1h =o eae) 


wae toa. iS 


à i ida el 
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(b) Use these results to obtain [L;, Ly] = iALy directly from Equation 4.96. 


(c) Evalue the commutators [L;, r?] and [L;, p?] (where, of course, p= 
x? 4 y? +z and p? = p+ gap. 


(d) Show that the Hamiltonian H = (p?/2m) + V commutes with all three 
components of L, provided that V depends only on r. (Thus H, L? , and L; 
are mutually compatible observables.) 


x *Problem 4.20 


| (a) Prove that for a particle in a potential V(r) the rate of change of the expéc- 
| tation value of the orbital angular momentum L is equal to the expectation 
value of the torque: 


di (L) = N), 


where. 
N =r x (-VV). 
(This is the rotational analog to Ehrenfest’s theorem.) 


(b) Show that d(L)/dt = 0 for any spherically symmetric potential. (This is one 
form of the quantum statement of conservation of angular momentum.) 


4.3.2 Eigenfunctions 


ica of all we need to rewrite Ly, Ly, and L, in spherical coordinates. Now, 
= (Rh/i)(r x V), and the gradient, in spherical coordinates, is: 2A 


pues E : [4.123] 
r 


r 86 + °F sind 99 


meanwhile, r — r£, so 


h ð 
L= 7 [rexAs +6 x O55 +6 «Daz: 


? George. Axfken and Hans-Jurgen Weber, Mathematical Methods-for Physicists, 5th ed., Aca- 
demic. Press, Orlando (2000), Section 2:5. 


^ 
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But (F x P) = 0, (Ê x 6) = $, atid (f x 6) = —Ó (see Figure. 4.1), and hence 


à e a ^ J - 
d ġ— -90 L k : [4.124] 
i V 00 sind 29, 


The unit vectors Ü and $ can be resolved into their cartesian components: 


6 = (cos 6 cos $)f + (cos @ sin $) f — (sink; [4.125] 
$ = —Gin yf + (cos 9f. [4.126] 
Thus 
h P E) 
L = 5 [sing + cos @ 56 
(cos. cos $1 + cos0 sing 7 sind k) — ô 
— (cosó. cos S — ——— 
: 2 nd ag 
Evidently 
ee | ing à cos ocot 2 [4.127] 
Is-r|-s IE A 4. 
L El t ci on in @:coté a) [4.128] 
y= 08047 —5 — i 
P ab, U 
and 
h ð 
= —-—. 4.129 
c= 136 [ ] 


We: shall also need the raising and. lowering operators: 
Ly = Lx tiL, = —|(—sing ticos 9) — — (cos + i sing) coto — |. 
i 90 ` 0o 
But cos + i sing = et, so 
/ Q Q0. " 
pics dt^ +i cot | [4.130] 
In particular (Problem 4.21(a)): 


LL. —-RH Cee ee T ag [4.131] 
BT 30 ag? ' ap)’ 
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and hence (Problem 4.21(b)): 


LE 0 rod 1 09 
L 2 —h* | —— — | sing =; . 4.132 
Er 00 (sin z) t sin? 0 x [ ] 


We are now in a position to determine f?” (0, $). Its an eigenfunction of £?, 
with eigenvalue A?1 (I + 1): 


L? f^ = gg 1 9 T ə \ as 1 8 pm -——1 14 1) ££ 
Salt = sin. 30 os 80 sin? 0 06? mcer ue 


But this is precisely the “angular equation" (Equation 4.18). And it's also an eigen- 
function of Lz, with the eigenvalue mh: 


hô, 
Lh E" ree = mf, 
+È i: f 


but this is equivalent to the azimuthal equation (Equation 4.21). We have. already 
solved this system of equations: The result (appropriately normalized) is the spher- 
ical harmonic, Y/"(0, @). Conclusion: Spherical hatmonics are eigenfunctions. of 
L? and L,. When we solved the Schrödinger equation by separation of variables, 
in Section 4.1, we. were inadvertently constructing simultaneous. eigenfunctions of 
the three commuting operators H, L?, and L;: 


Hy = Ey, Ly = ld +), Lb =hmy. [4.133] 


Incidentally, we can use Equation 4.132 tọ rewrite the Schrödinger equation (Equa- 
tion 4.14) more compactly: 


1 9/50. 

There is a curious final twist to this story, for the algebraic theory of 
angular momentum permits / (and hence also m) to. take on Aalf-integer values 
(Equation 4.119), whereas separation of variables yielded eigenfunctions only for 
integer values (Equation 4.29). You might. suppose that the half-integer solutions 
are spurious, but it turns out that they are of profound importance, as we shall see 
in the following sections. 
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+Problem 4.21 
(a) Derive Equation 4.131 from Equation 4.130. Hint: Use a test function; other- 
wise you're likely to drop some terms. 


(b) Derive Equation 4.132 from Equations 4.129 and 4.131. Hint: Use Equa- 
tion 4.112. 


xProblem 4.22 
(a) What is LY}? (No calculation allowed!) 


(b) Use the result of (a), together with Equation 4.130 and the fact that. L; Y} = 
hl Y to determine Yl (0.0), up to a normalization constant. 


(c) Determine the normalization constant by direct integration. Compare your 
final answer to what you got in Problem 4.5. 


Problem 4.23 In Problem 4.3 you showed that 
¥3(0,@) = —v/15/87 sin cos 6e'?, 


I Apply the raising operator to find Y2(6, @). Use Equation 4.121 to get the normal- 
ization. 


Problem 4.24 Two particles of mass m are attached to the ends of a massless rigid 
rod of length a. The system is free to rotate in three. dimensions about the center. 
| (but the center point itself is fixed). 


(a) Show that the allowed energies of this rigid rotor are 


" _ ant 1) 


| nea A 
i ma? ` 


for n = 0, l, 2, vee 


Hint: First express the (classical) energy in terms of the total angular momen- 
tum. 

(b) What are the normalized eigenfunctions for this system? What is the degen- 
eracy of the nth energy level? 


—À 
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In classical mechanics, a.rigid object admits two kinds of angular momentum: 
orbital (L = r x p), associated with the motion of the center of mass, and spin 
(S. = I), associated with motion about the center of mass. For example, the earth 
has orbital angular momentum attributable to its annual revolution around the sun, 
and spin angular momentum coming from its daily rotation about the north-south 
axis. In the classical context this distinction is largely a matter of convenience, for 
when you come right down to it, S is nothing but the sum total of the “orbital” 
angular miomenta of all the rocks and dirt clods that go to make up the earth, as 
they circle around the axis. But an analogous thing happens in quantum mechanics, 
and here the distinction is absolutely furidamental. In addition to orbital angular 
momentum, associated (in the case of hydrogen) with the motion of the electron 
around the nucleus (and described by the spherical harmonics), the electron .also 
carries another form of angular momentum, which has nothing to do with motion 
in space (and which is not, therefore, described by any function of the position 
variables r, 0, 6) but which is somewhat analogous to classical spin (and for which, 
therefore, we. use the same word). It doesn't pay to press this analogy too far: The 
electron (as.far as we know) is a structureless point particle, and its spin angular 
momentum cannot be decomposed into orbital angular momenta of constituent 
parts (see Problem 4.25).?? Suffice it to say that elementary particles carry intrinsic 
angular momeritum (S) in addition to their “extrinsic” angular momentum (L). 
The algebraic theory of spin is a carbon copy of the theory of orbital angular 
momentum, beginning with the fundamental commutation relations:2° 


[Sx, Sy] =iAS,, [Sy,8,] =ihS,, [Se S] = ihS,. [4.134] 
It follows (as before) that the eigenvectors of s? and S; satisfy?" 


S^|sm) — B^s(s - 1)]sm);  $,|s m) — Am|s m); [4.135] 


ZRor a contrary interpretation, see Hans C. Ohanián, “What is Spin?" Am. J. Phys. 54, 500 
(1986). 


26 We shall take these äs postulates for the theory of spin; the analogous formulas for orbital angu- 
lar omentum (Equation 4.99) were derived. from the known. form of the operators (Equation 4.96). In 
a more sophisticated treatment they can both be obtained from rotational invariance-in three dimensions 
(see, for example; Leslie E. Ballentine, Quanfum Mechanics: A Modern Development, World Scientific, 
Singapore. (1998), Section 3.3). Indeed, these fundamental commutation relations apply to all forms of 
angular momentum, whether spin, orbital, or the combined angular momentum of a composite system, 
which could include some spin and some orbital. 

27 Because the eigenstates of spin are nol functions, 1 will use the “ket? notation for them. (I 
could have done thé same in Section 4.3, writing [I m) in place of Y7", but in that context the-function 
notation seems more natural.) By the way, I'm running out of letters, so I'll use m for the eigenvalue 
of Sz, just as I did for. Ly (some authors write mz and ms at this stage, just to be absolutely clear). 


x 
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and : 


S4|sm) = A sts D —m(n £1) |s (m E 1), [4.136] 


where S4 = S, t: i $,. But this time the eigenvectors are not spherical harmonics. 
(they're not functions of 6 and $ at all), and there is no a priori reason to exclude 
the: half-integer values of s and m: 


4. 3 
a. 5 1, rt .; m-—-—s,—$4d1,...,$—1,s. [4.137] 


It so happens that every elementary particle has a specific and immutable . 
valué of s, which we call the spin of that particular species: pi mesons have spin 
0; electrons have spin 1/2; photons have spin 1; deltas have spin 3/2; gravitons have 
spin 2; and so on. By contrast, the orbital angular momentum quantum number 


I (for an electron in a hydrogen atom, say) can take on any (integer) value you 
‘please, and will change from one to another when the system is perturbed. But s 


is fuh ot any given particle, and this makes the theory of spin comparatively 
simple.29 


Problem 4.25 1f the electron were a classical solid sphere, with radius 


2 
d cde. 4.138 
re areonc? Hs 


(the so-called classical electron radius, obtained by assuming the electron's mass is 
attributable to energy stored in its electric field, via the Einstein formula E — m c2), 
and.its angular momentum is (1/2)A, then how fast. (in m/s) would a point.on the 
“equator” be moving? Does this model make sense? (Actually, the radius of the. 
electron is known experimentally to be much less than ro, but this only makes 
matters worse.) 


28 Indeed, in a mathematical sense, spin 1/2 is the simplest possible nontrivial quantum system, for 


it admits just two basis states. In place of an infinite-dimensional Hilbert space, with all its subtleties 
and. complications, we find ourselves working in an ordinary 2-dimensional vector space; in place 


of unfamiliar differential equalions and fancy functions, we are confronted with 2 x 2 matrices and 
2-component vectors. For this reason, some: authors. begin- quahtum mechanics with the study of spin. 
(An dn example i is John S. Townsend, A Modem t to diverts Mechanics mer 


iret ‘not to do it ‘that way. 
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4.4.1 Spin 1/2 

By far the most.important case is s = 1/2, for this is the spin of the particles that 
make. up ordinary matter (protons, neutrons, and electrons), as well as all quarks 
and. all leptons. Moreover, once. you understand. spin 1/2, it is a simple matter to 
work out the formalism for any higher spin. There are just two: eigenstates: Iż 4), 
which we call. spin up (informally, +), and |i (- 1), which we call spin down 
(|). Using these as basis vectors, the general state of a spin-1/2 particle can be 
expressed as a two-element column matrix (or spinor): 


x= (5) = ax. t bx-, [4.139] 
with E 
X= (o) [4.140] 


rT (7) [4.141] 
for spin down. 


Meanwhile, the spin opérators become 2 x 2 matrices, which we. can work out. 
by noting their effect on x, and x_. Equation 4.135 says 


representing spin up, and 


" 3 | 3 
Sx = Mx. ad Sx = WX. [4.142] 


If we write S? as a matrix with (as yet) undetermined elements, 


then the first equation says 


| "um m 
c dV(1N 3,5 f1 c\ [—-Rh* 
C j (o) E (o): li C) uv 


so c= {3 OR and e = 0. The second équation, says. 


c 30-0. = 0-() 


so d = 0 and f = (3/4)h?. Conclusion: 


g.jp € 1) [4.143] 
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Similarly, : 
Sia. = 5X4,  Sy---zx. [4.144] 
2 2 
from which it follows that : 
| (1 0 
SES D x [4.145] 


Meanwhile, Equation 4.136 says 


Sip. = Fx, So xo Shy, S4x+=S-x- =0, 


sO a : 
m 0240 3E « 00 MW 
S, =} (o o) , S_= ( o) : [4.146] 
Now $4 = S, iS), so Sy = (1/2) (S4 + S_) and S, = (1/2i)(S4} — S_), and 
.. hence 5 f l h l 
A (0 1 . h(0 -i " 
S, = 2 « o) p SE t 0 ) [4.147] 


Since Sx, Sy, and S, all carry a factor of h/2, it is tidier to, write S = (4/2)o , 
where 


[4.148] 


These are the famous Pauli spin matrices. Notice that Sx, Sy, S;, and S? are. all 
hermitian (as they should be, since they represent observables). On the other hand, 
S4 and S... are not hermitian —evidently they are not observable. 

The eigenspinors of S; are (of course): 


X mm (o) : (cigenvalue + 5) ae A (1) : (<igenvatue — z) . [4.149] 


Tf you measure S; on a particle in the genéral state x (Equation 4.139), you could 


get +h/2, with probability |a|?, or —/2, with probability |b|”: Since these are the 


only. possibilities, 


la? +ib? — 1 [4.150] 


(ie., the spinor must be normalized)? 


?9 people often sày that laf? is the “probability that the particle is in the..spin-up state” but this 
is sloppy language; what they mean is that if you measured Sz, la? is the probability you'd get 4/2. 
See footnote 16 in Chapter 3. 
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But what if, instead, you chose to measure $x? What are the. possible results, 
and what are their respective probabilities? According to the generalized statisti- 
cal interpretation, we need to know. thé eigenvalues and eigenspinors of S,. The 
characteristic equation is 


2. 
sA A/2 ; fh j 
e o con 
A/2 Ae 02A =(5) TEES 


Not surprisingly; the possible values for S, are the same as those for S,. The 
eigenspinors are obtained in the usual way: 


ah o)(5)=*2 (5) > 7 * (9) 


so B = t o. Evidently the (normalized) eigenspinors of S, are 


(lo l 
Va K ae E h 
xP = 2 ; (cizenvatue + SF x = A  (cigenvatue 2) [4.151] 
Wa v2 


As the eigenvectors of a hermitian matrix, they span the space; the generic spinor. 
x (Equation 4.139) can be expressed as a linear combination of them: 


| a+b (x) (‘ =) (x) 4.152 
x= GE: $ B Ie [4.152] 


If you measure 5,, the probability of getting +h/2 is (1/2)|a + b|?, and the 
probability of getting —A/2 is (1/2)|a — b|^. (You should check for yourself that 
these probabilities add up to 1.) 


Example 4.2 Suppose a spin-1/2 particle is in the state 


x=33('7'). 


What are the probabilities of getting +//2 and —h/2, if you measure $; and 5,? 


Solution: Here a = (1-++i)/./6 and b = 2/4/6, so for S, the probability of getting 
+h/2 is |(1 + D/4/6P. = 1/3, and the probability of getting —h/2 is. |2/ V6)? = 
2/3. For Sy the probability of getting +A/2 is (1/2)|(3 + i)/v6 = 5/6, and 


* 
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the probability of getting —h/2 is*(1/2)|(—1 + )/V6|? = 1/6. Incidentally, the 


expectation value. of S, is. 
5 x el (RN OR 
6X 2)" 6. 27. 9 
which we. could also have obtained more directly: 


se ae CI) 


(Sx) = x Sy Xx = ( =z 


I'd like now to walk you through an imaginary measurement scenario involv- 
ing spin 1/2, because it serves to illustrate in very concrete terms some of the 
abstract ideas we. discussed back in Chapter 1. Let’s say we start out with a parti- 

_ cle in the state x+. If someone asks, “What is the z-component of that particle's 
spin angular momentum?", we.could answer unambiguously: +4/2. For a measure- 
ment of S. is certain to return that value. But if our interrogator asks instead, “What 
is the x-component of that particle's spin angular momentum?" we are obliged to. 
equivocate: If you measure S,, the chances ate fifty-fifty of getting either 7/2 or 
—h/2. If the questioner is a classical physicist, or a “realist” (in the sense of Section 
1.2), he will regard.this as an inadequate— not to say impertinent—response: “Are 
you telling me that you don’t know the true state of that particle?" On the contrary; 
I know precisely what the state of the particle is: x4. “Well, then, how come you 
can't tell me what the x-component of its spin is?" Because it simply does nof 
have a particular x-component of spin. Indeed, it cannot, for if both S, and $, 
were well-defined, the uncertainty principle would be violated. 

At this point our challenger grabs the test-tube and measures the x-component 
of its spin; let's say he gets the value 4-5/2. “Aha!” (he shouts in triumph), “You 
lied! This particle has a perfectly well-defined value of Sx; h/2." Well, sute—it 
does now, but that doesn’t prove it had that value, prior to your measurement. “You 
have.obviously been reduced to splitting hairs. And anyway, what happened to your 
uncertainty principle? I now. know both S, and $z.” I'm sorry, but you do not: In 
the course of your measurement, you- altered the particle’s state; it is now in the 
state xcu and whereas you know the value of Sy, you no longer know the value of 
Sz. "But I was. extremely careful not to disturb the particle when I measured S,.” 
Very well, if you don't believe mé, check it out: Measure S,, and see what you 
get. (Of course, he may get --h/2, which will be embarrassing to my case— but if 
we repeat this whole scenario over and over, half the time he will get —A/2.) 

To the layman, the philosopher, or the classical physicist, a statement of the 
formi “this particle doesn't have a well-defined position" (or momentum, or x- 
component of spin angular momentum, or whatever) sounds vague, incompetent, 
or (worst of all) profound. It is none of these. But its precise meaning is, I think, 
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almost impossiblé to convey to anyone who has not studied quantum mechanics 
in.some depth. If you find your own comprehension slipping, from time to time 
(if you don’t, you probably haven't understood. the problem), come back to the 
spin-1/2 system: It is the simplest and cleanest context for thinking through the 
conceptual paradoxes of quantum mechanics. 


Problem 4.26 


(a) Check that the. spin matrices (Equations 4.145 and 4.147) obey the funda- 
mental commutation. relations for angular momentum, Equation 4.134. 
(b) Show that the Pauli spin matrices (Equation 4.148) satisfy the product rule 
Ojok = Ojk +i M € jklO1, [4.153] 
d 


where the indices stand for x, y, or z, and eji is the Levi-Civita symbol: 
+1 if jkl = 123, 231, or 312; —1 if jkl = 132, 213, or 321; 0 otherwise. 


xProblem 4.27 An electron is in the spin state: 


gea) 


(a) Determine the normalization constant. A. 
(b) Find the expectation values of Sx, Sy, and S;. 


(c) Find the “uncertainties” o3,, os,, and os,. (Note: These sigmas are standard 
deviations, not Pauli matrices!) 


(d) Confirm that your results are consistent with all three uncertainty principles 
(Equation 4.100 and its cyclic permutations—only with S in place of L, of 
course). 


«Problem 4.28 For the most general normalized spinor x (Equation 4.139), com- 
pute (Sx), (Sy), (Sz), (SD. (S$), and (S7). Check that (82) + (Sj) + (S2) 
= (85, 


xProblem 4.29 
(a) Find the eigenvalues and eigenspinors of Sy. 


4 
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(b) If you measured Sy on a particle in the general state. x (Equation 4.139), 


what values might you get, and what is the probability of each? Check that | 
the probabilities add up to 1. Note: a and b need not be:real! | 
i 


(c) If you measured S2, what values might you. get, and with what probabilities? 


+ Problem 4.30 Construct the matrix S, representing the component of spin angular 
momentum along an arbitrary direction 7. Use spherical coordinates, for which 


f = sin coso î + sinb sing f+ cos k. [4.154] 
Find the eigenvalues and (normalized) eigenspinors of S,. Answer: 


O = ( £989/2) \. o. (e sino 2" | 
um Z won): T ( — cos(8/2) ) ned 


Note: You're always free to multiply by an arbitrary phase factor— say, ei? —so 
your answer. may not [ook exactly the same as mine. 


Problem 4.31 Construct the spin matrices (S,, Sy, and S;) for a particle of 
spin 1. Hint: How many eigenstates of S; are there? Determine the action of Sz, 
S+, and S. on each of these states. Follow the procedure used. in the text for 
spin 1/2. 


4.4.2 Electron in a Magnetic Field 


A spinning charged particle constitutes a. magnetic. dipole. Its magnetic dipole 
moment, u, is proportional to its spin angular momentum, S: 


p yS; [4.156] 


the proportionality constant, y, is called thé gyromagnetic ratio20 When a rhag- 
netic dipole is placed in a magnetic. field B, it experiences a torque, u x B, which 


30 See, for example, D. Griffiths, Introduction to Electrodynamics, 3rd ed. (Prentice Hall, Upper 
Saddle River, NJ, 1999), page 252. Classically, the gyromagnetic ratio of an object whose charge and 
mass are identically distributed is q/2m, where d is the charge and m is the mass. For reasons that are 
fully explained only in relativistic quantum theory, the gyromagnetic, ratio of the- electron is (almost) 
exactly twice the classical value: y — —e/m. 
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tends to line it up parallel to the field Gust like a compass needle). The energy 
associated with this torque isl 


H — y. B, [4.157] 
so the Hamiltonian of a spinning charged particle, at rest? in a magnetic field B, 


is 
H=—yB-S. [4.158] 


Example 4.3 Larmor precession: Imagine a particle of spin 1/2 at rest in a 
uniform magnetic field, which points in. the z-direction: 
B = Bok. [4.159] 


The Hamiltonian (Equation 4.158), in matrix forim, is 


| yBoh (1 0° 
H = -y BS, = -* > (à e [4.160] 


The eigenstates of H are the same as those of S;: 


| X+, with energy E, = —(y Boh)/2, [4.161] 


X—. with energy E— = +(y BoR)/2. 


Evidently the energy is lowest when the dipole moment is. parallel to the field — just 
as it would be classically. 
Since the Hamiltonian is time-independent, the general solution to the time- 
_ dependent Schródinger equation, 
ax 


ih — = Hy, 4.162] 
i E? X [ ] 


can be expressed in terms of the stationary states: 


(t) = Ey t/h +b ae es) _ ae? tate 
X AX+E X-€ Puro 


3! Griffiths (footnote 30), page 281. 

321f the particle is allowed to move, there will also be kinetic energy to consider; moteover, it 
will be subject to. the Lorentz force (gv x B), which is not.dcrivable:from a potential energy function, 
and hence does.not fit the Schrödinger equation as. we have formulated it so far. PIL show you later 
on how to handle this (Problem 4.59), but for the moment. I '§ just assume that the particle is free to 
rotate, bul otherwise stationary. 
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The constants a and b are determined by the initial conditions: 


PE 
x(0) = B 


(of course, |a|? + |b)? = 1). With no essential loss of generality?? PI write a = 
cos(a/2) and b = sin(a/2), where o is a fixed angle whose physical significance 
will appear in a moment. Thus 


^ LY Bot /2 
TM ( cos(o./2)ei Bot/ ) [4.163] 


sin(g/2)e Y Bui? 


To get a feel for what is happening here, let's calculate the: expectation value of S, 
as. a function of time: 


(Se) = x (0S.x (t) = (cos(a/2)e 7" P? — sin(g/2)elY Bot?) 


h (0 1Y f cos(a/2)elY Bot? | 
E B o) Ge /2)e iY Bot /2 [4.164] 


= » sin cos(y Bot). 


Similarly, 
(Sy) = x(0'Syx() = -7 sin à sin(y Bor), [4.165] 
and h 
(Sc) = X (D Sex 0) = 5 cose. | [4.166] 


Evidently (S) is tilted at a constant.angle o to the z-axis, and precesses about the 
field. at the Larmor frequency 
w = y Bo, [4.167] 


just as it would classically** (see Figure 4.10). No surprise here —Ehrenfest's 
theorem (in the form derived in Problem 4.20) guarantees that (S) evolves accord- 
ing to the classical laws. But it’s nice to see how this works out in a specific 
context. 


33This does assume that a and. b are real; you can work out the. general case if you like, but all 
iL.does is add a constant to 7. 

34 See. for instance, The Feynman Lectures on Physics (Addison-Wesley, Reading, 1964), Volume: 
IL Section 34-3. Of course, in thc classical case it is the angular momentum vector itself, not just its 
expectation value, that precesses around the magnetic. field. 
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xY 


FIGURE 4.10: Précession of (S) in a 
uniform magnetic field. 


Example 4.4 The Stern-Gerlach experiment: In an inhomogeneous magnetic 
field, there is not only a torque, but also a force, on a magnetic dipole:* 


F=V(u-B). [4.168] 


This force can be used to separate out particles with a particular spin orienta- 
tion, as follows. Imagine a beam of relatively heavy neutral atoms," traveling in 
the y direction, which passes through a région of inhomogeneous magnetic field 
(Figure 4.11)— say, 


B(x, y, z) = —axf + (By + az), [4.169] 
where. Bo is à strong uniform field and the constant œ describes a small deviation 
from homogeneity. (Actually, what we'd like is just the z component, but unfortu- 
nately that's impossible— it would violate thé electrómagnetic law V . B = 0; like 


it or not, an x component comes along for the ride.) The force on thesé atoms is 


F = yaC- f + Sk). 


S Griffiths. (footnote 30), page 258. Note that F is the negative gradient of the energy 
(Equation 4.157). j 

36We make thém neutral so às to avoid the large-scale deflection that. would otherwise result 
from the Lorentz force, and heavy so we.can construct: localized wave packets and treat the. miotion 
in terms of classical particle trajectories, In practice, the. Stern-Gerlach experiment doesn't work, for 
example, with a beam of free electrons. 


a 
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Spin up 


‘Spin down 


Magnet 


FIGURE 4.11: The Stern-Gerlach apparatus. 


But because. of the Larmor precession about Bo, $+ oscillates rapidly, and 
averages to zero; the net force is in the z direction: 


Fz = yasz, [4.170] 


‘and the beam is deflected up or down, in proportion to the. z component of the 


spin angular momentum. Classically we'd expect a smear (because. $; would not. 
be quantized), but in fact the beam splits into 2s + 1 separate streams, beau- 
tifully demonstrating the quantization. of angular momentum. (If you usé silver 
atoms, for example, all the inner electrons: are paired, in such a way that their 
spin and orbital. angular momenta cancel. The. net spin is simply that of the: 
outermost—unpaired—electron, so in this case.s = 1/2, and the beam splits 
in two.) 

Now, that argument, was purely classical, up to the very final step; “force”. 
has no place in a proper quantum calculation, and you might therefore: prefer 
the following approach to the same problem.?" We examine the process from the 
perspective of a reference frame that moves along with the beam. In this frame the 
Hamiltonian starts out zero, turns on for a time T (as the particle passes through 
the magnet), and then turns off again: 


0, for £ <.0, 
A(t) 24—y(Bo--az)S, forO0<t<T, [4.171] 
0, for t > T. 


(I ignore the pesky x component of B, which— for reasons indicated above—is 
irrelevant to the problem.) Suppose the atom has spin 1/2, and starts out in the 
state 

X(t) =axyy+bx-, fort <0. 


37 This argument follows L. Ballentine (footnote 26) Section 9.1, 
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While the Hamiltonian acts, x(t) evolves in the usual way: 
X(t) = axe IF + by e IE for «t « T, 


where (from Equation 4.158) 
E+ = ry (Bo t az) >; [4.172] 


and hence it emerges in the state 


Ke) = (ac T/y, ) BOYT 4 (pu-ivrme, Ja evt, [4.173] 
(for t > T) The two terms now carry momentum in the z direction (see Equa- 
q 
tion 3.32); the spin-up component has momentum 
, ayTh 


= ; 4.174 
Pz 7 [4.174] 


and it moves in the plus-z direction; the spin-down component has the oppo- 
site momentum, and it moves. in the minus-z direction. Thus the beam splits 
in two, as before. (Note that Equation 4.174 is consistent with the earlier result 
(Equation 4.170), for in this case S, = 5/2, and p; = FT.) 

The Stern-Gerlach experiment has played an important role in the philosophy 
of quantum méchanics, where it serves both as the prototype. for the preparation 
of a quantum state and as an illuminating model for a certain kind of quantum 
measurement. We tend casually to assume that the initial state of a system is known. 
(the Schrédinger equation tells us how it subsequently evolvés)— but it is natural 
to wonder how you get a system into a particular state in the first place. Well, if 
you want to prepare a beam of atoms in a given spin configuration, you pass an 
unpolarized beam through a Stern-Gerlach magnet, and select the outgoing stream 
you are interested in (closing off the others with suitable baffles and. shutters). 
Conversely, if you want to measure the z component of an atom's spin, you send 
it through a Stern-Gerlach apparatus, and record which bin it lands in. I do not 
claim that this is always the most practical way to do the job, but it is conceptually 
very clean, and hence a useful context in which to explore the problems of state 
preparation arid measurement. | 


Problem 4.32 In Example 4.3: 


(a) If you measured the component of spin angular momentum along the x 
direction, at time £, what is the probability that you would get +#/2? 
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(b) Same question, but for the y»component. 


(c) Same, for the z component. 


x *Problem 4.33 An electron is at test in an oscillating magnetic field 
B= Bo cos(wt)k, 


where Bo and w. are constants. 


(a) Construct the Hamiltonian matrix for this. system. 


(b) The electron starts out (at t = 0) in the spin-up state with respect to the 
x-axis. (that is: x (0) = xv. Determine x (£) at any subsequent time. Beware: 
This is a time- EDEN Hamiltonian, so you cannot get x(£) in the usual 
way from stationary states. Fortunately, in this case you can solve the time- 
dependent Schródinger equation (Equation 4.162) directly. 


(c) Find the probability of getting —h/2, if you. measure Sy. Answer: 


siní (Em sin(@ D). 
20 


(d) What is the minimum field (Bo) required to force a complete flip in Sy? 


4.4.3 Addition of Angular Momenta 
Suppose now that we have two spin-1/2 particles—for example, the electron and 


the proton in thé ground state?? of hydrogen. Each can have spin up or spin down, 
so there are four possibilities in all:?? 


tts the YT. [4.175] 


where the first arrow refers to the electron and the second to the proton. Question: 
What is the total angular momentum of the atom? Let 


§=5) 4 s2, [4.176] 


38] put them in the ground state so there won't be any orbital angular mofncritüm to worry about. 


3° More precisely, each particle is in a linear combination of spin up and spin. down, and. the 
composite system is in a linear combination of the four states. listed. 
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Each of these four composite states is an eigenstate of $,—the z components 
simply add: 
Sexix2 = (SP + Sy x3 90 = (SP XV) jn (SO x0) 
= (him Xx) X2 + x1 ima x2) = Amy + m2)X1 X2, 

(note that S® acts only on x1, and S® acts only on x2; this notation may not be 
elegant, but it does the job). So m (the quantum number for the composite system) 
is just my + ma: 

Tt: m= 

Tl: m — 0; 

LE m= 0; 

Li: m=-1. 

At first glance; this doesn’t look right: m is supposed to advance in integer 

steps, from —s to +s, so it appears that s = 1—but there is an “extra” state 


with m = 0. One way to untangle this problem is to apply the lowering operator, 
S- = SC) +5 to the state ++, using Equation 4.146: 


S-41 = (GU 444 4 (S 4 
=AL Tr-TOJD-hQT- TD. 


Evidently the three states with s = 1 are (in the notation |s m)): 


== 
|IED- 44 

l0 —J5M +d) p s= 1 (triplet). [4.177] 
Ln- = 4 


(As a check, try applying the lowering operator to |10); what should you get? See 
Problem 4.34(a).) This is called the triplet combination, for the obvious reason. 
Meanwhile, the orthogonal state with m = 0 carries s = 0: 


[ioo = 5c — 40]. s = 0 Gingen. [4.178] 


(If you apply the raising or lowering operator to this state, you’ll get zero. See 
Problem 4.34(b).) 
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I claim, then, that the combination of two spin-1/2 particles can carry a total 
spin of 1 or 0, depending on whether they occupy the triplet or the singlet config- 
uration. To confirm this, I need to-prove. that the triplet states are eigenvectors of 
8? with eigenvalue 28, and the singlet is an eigenvector of S? with eigenvalue 
0. Now, 


8? = (SM +8). GO 4 $0) = (SM)? + (9) 428M .sO. — [4179] 
Using Equations 4.145 and 4.147, we have 


SD. SAY) = 6M 4S? 1). 4 SP HG YHEL Hs? D 


“ENGNG GNGY) 


B : | 
| 204-90. 
Similarly, 


| n 
SYS) = TON- 
It. follows that 


HP] 


gO .sO 10 
|1 0) 4 Si 


2 
QU-N42t-W=F10, 4.180) 


and 


sP $100) =F —E@It-N2N+ =F 100), 4.81) 


Returning to Equation 4.179 (and using Equation 4.142), we conclude that. 


i (38 amb A? | 
S*|10) = ( a + 4 7) |10) = 2h7|10), [4.182] 


so |10) is indeed an eigenstate of S? with eigenvalue 2h; and 


/3n? 3A aA?) RE 
$2100) = E i ; -277 100) = 0. [4.183] 


so |00) is an eigenstate of S? with eigenvalue 0. (I will leave it for you to confirm 
that |] 1) and |1 —1) are eigenstates of S?, with the appropriate eigenvalue — see 
Problem 4.34(c).) 
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What we have just.done (combining spin 1/2 with spin 1/2 to get spin 1 and 
spin 0) is the simplest example of a larger problem: If you combine spin s; with 
spin s2, what total spins s can you get?^9 The answer*! is that you get every spin 
from (s; + s2) down to (s1 — 52)—or (sz — sz), if sz > s —in integer steps: 


s = (s1 +52), (14-52 — D, (19-855 —2), .... |i — 32|: [4.184] 


(Roughly speaking, the highest total spin occurs when the individual spits are 
aligned parallel to one another, and the lowest occurs when they are antiparallel.) 
For example, if you package together a particle of spin 3/2 with a particle of spin 
2, you could get a total spin of 7/2, 5/2, 3/2, or 1/2, dépending on the configuration. 
Another example: If a hydrogen atom is in the state Wyrm, the net angular momen- 
tum of the electron (spin plus orbital) is Z + 1/2 or / — 1/2; if you now throw in 
spin of the proton, the atom’s total angular momentum quantum number is / + 1, 
l, or | — 1 (and / can be achieved in two distinct ways, depending on whether the 
electron alone is in the / + 1/2 configuration or the / — 1/2. configuration). 

The combined state. |s m) with total spin s and z-component m will be some 
linear combination of the. composite states |s1 71) |s2 ma): 


m= Y Chi mlsi mi)ls2m2) [4.185] 


m--m;—m 


(because the. z components add, the only composite states that contribute. are those 
for which m; + m2 = im). Equations 4.177 and 4,178 are special cases of this 
general form, with s, = sp = 1/2 (I used the informal notation + = la +), = 
b —1))). The constants Cj//5,,, are called Clebsch-Gordan coefficients. A few 
of the simplest cases are listed in Table 4.8.? For example, the shaded column of 
the 2 x.1 table tells us that 


30 = 4211 1) + 432010) + 32 1)|11). 
In particular, if two particles (of spin 2 and spin 1) are at rest in a box, and the total 


spin is 3, and its z component is 0, then a measurement of gid could return the 
value A (with probability 1/5), or 0 (with probability 3/5), or —A (with probability 


401 Say spins, for simplicity, but either one: (or both) could just as well be orbital angular 
momentum (for which, however, we would use the letter 1). , 


4l For a proof you must look in à more advanced text; see, for instance, Claude Cohen-Tannoudji, 
Bernard Diu, and Franck Laloé, Quantum Mechanics, (Wiley, New York, 1977), Vol. 2, Chapter X. 


The general formula is.derived in Amo Bohm, Quantum Mechanics: Foundations and Appli- 
cations, 2nd. ed., (Springer, 1986), p. 172. 


A 
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TABLE 4.8: Clebsch-Gordan coefficients. (A square root sign i$ uridérstood for 
every entry; the minus sign, if present, goes outside the radical.) 


We x vaj- : 
Esp. l 2» 1/2 sekr 


aye ue if T 
-12 4/2 |12 -i/2 4 
-142 —1/2 
1% 1/2 
= 3/2 1/2 
(M2 +142 
A 2| vs cx sx» i2 
9 2| 28-13-12 -12 
0-12] 2. wall 32 
-1 41/2} 13 -2/3[-372 


42-2 /15 4/5] 5i2 y2 
+E +42 [45 1/50 1/2 «12 


4l -1/2-] 25 35 
0 +4/2 


/5bl 5m 32. 
3/5 -5|-1/2 +172] 


0 -12| a5 gsf 52 ae 
-1 +42 2/5 -452 -y2 
- 12 48 Tl 5/2 
-2 (n/2| 1/5 -4/5]-52 
2|s4 -ij D 

«12. -t/2|32. 12] 2 1 
-12 +122 |12 -12|]-1 -1 
-12 -1/2 (3/4 tif 2 
~3/2 v1/2|1/4 -3/4|-2 


15/2) BP 3/2 
132 43/2 
«x2 0 [2S gsf 52. 3/2 
+2 +4 | 3/5 2s [e He +1/2 
isi ims faex 52 3S2? V2 | 
Loe -12 -41/2 


T -1 35 2/5[-52 
-8 0| 2/5 -3/5]--5/2 


real 


1x1 


E E. 
Er 
+ Oe E 

0.d [i2 -t[ 0 
+1 -1 [1/6 is T 
0 0/23 -sfz 1 
141416 72 13494 i 
0-1|wu2 iof 2 
=4 Ofte -1/28 -2 


il] 


-1 -1)28 14. 
2 0) 1/3 -2/3 3 


1/5). Notice that the probabilities add up to.1 (the sum of the squares of any column 
on the Clebsch-Gordan table is 1). 
These tables also work the other way around: 


si m)1s2 ma) = Y C222, ls m). [4.186] 


y 


For example, the shaded row in the 3/2 x I table tells us that 


ibuo-/35D-/5üb-/Iup. 


If you put particles of spin 3/2 and spin 1 in the box, and you know that the 
first has mı = 1/2 and the second has mz = 0 (so m is necessarily 1/2), and. 
you measure the total spin, s, you could get 5/2 (with probability 3/5), or 3/2 
(with, probability 1/15), or 1/2 (with probability 1/3). Again, the. sum of the prob- 
abilities is 1 (the sum of the squares of each row on the Clebsch-Gordan table 
is 1). 

If you think this is starting to sound like mystical numerology, I don't blame 
you. We will not be using the Clebsch-Gordan tables. much in the rest of the 
book; but I wanted you to know where.they fit into the scheme of things, in case 
you encounter them later on. In a mathematical sense this is all applied group 
theory —what we are talking about is thé decomposition of the direct product of 


Section 4.4: Spin 189 


two irreducible representations of the rotation group into a direct sum of irreducible 
representations (you can quote that, to impress your friends). 


xProblem 4.34 


(a) Apply S_ to |10) (Equation 4.177), and confirm that you get /2h|1—1). 
(b) Apply S+ to |00) (Equation 4.178), and confirm that you get zero. 


(c) Show that |1 1) and |1 —1) (Equation 4.177) are eigenstates of S°, with the. 
appropriate eigenvalue. 


Problem 4.35 Quarks carry spin 1/2. Three quarks bind together to make a baryon 
(such as the proton or neutron); two quarks (or more precisely a quark and an 
antiquark) bind together to make a meson (such as the pion or the kaon). Assume 
the quarks are in the ground state. (so the orbital angular momentum is zero). 


(a) What spins are possible for baryons? 


(b) What spins are possible for mesons? 


Problem 4.36 


(a) A particle of spin 1 and a particle of spin 2 dre at rest in a configuration 
such that the total spin is 3, and its z component is 7. Jf you measured the 
z component of thé angular momentum of the spin-2 particle, what values 
might you get, and what is the probability of each one? © 


(b 


— 


An electron with spin down is in the state W519 of the hydrogen atom. If you 
could measure the total angular momentum squared of the. electron alone 
(not including the proton spin), what values might you get, and what is: the 
probability of each? 


Problem 4.37 Determine the commutator of S? with sf (where S = S + SQ), 
Generalize your result to. show that 


Comment: Because s does not commute with S?, we cannot hope to find states 
that are simultaneous eigenvectors of both. In order to form eigenstates of S? we 


a 
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need linear combinations of eigenstates of s» Thís is precisely what the Clebsch- 
Gordan coefficients (in Equation 4.185) do for us, On the other hand, it follows by 
obvious inference from Equation 4.187 that the sum S(? -- S? does commute with 
S?, which is a special case of something we already knew (see Equation. 4.103). 


FURTHER PROBLEMS FOR CHAPTER 4 


xProblem 4.38 Consider the three-dimensional harmonic oscillator, for which 
the. potential is. 
2 


a, Al | | 
Vir) = jme r2, [4.188] 


(a) Show that separation of variables in cartesian coordinates turns this into 
three one-dimensional oscillators, and exploit your knowledge of the latter 
to determine the allowed energies. Answer: 


Ej = (à + 3/2)ho. [4.189] 


(b) Determine the degeneracy d(n) of Ep. 


x x Problem 4.39 Because the three-dimensional harmonic oscillator potential (Equa- 
tion 4.188) is spherically symmetric, the. Schrödinger equation can be handled by 
separation of variables in spherical coordinates, as well as cartesian coordinates. 
Use the power series method to solve.the radial equation. Find the recursion formula 
for the coefficients, and determine the allowed energies. Check your answer against 
Equation 4.189. 


xProblem 4.40 
(a) Prove tlie tliree-dimensional virial theorem: 
XT) = (r- VV) [4.190] 


(for stationary states). Hint: Refer to Problem 3.31. 


(b) Apply the virial theorem to the case of hydrogen, and show that 


(T)=—En; (V) =2En. [4.191] 
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(c) Apply the virial theorem to the three-dimensional harmonic oscillator (Prob- 
lem 4.38), and show that in this case 


(T) = (V) = E,/2. [4.192] 


x xProblem 4.41 [Attempt this problem only if you are. familiar with vector cal- 
culus.] Define the (three-dimensional) probability current by generalization of 
Problem 1.14: " 

Jm; (wvw* wt vv). [4.193] 
2m 


(a) Show that J satisfies the continuity equation 


EE 
V. [4.194] 


which expresses local conservation of probability. It follows (from the 
divergence theorem) that 


| ass Mw? dr, [4.195] 
S dt Jy 


where Y is a (fixed) volume and S is its boundary surface. In words: The flow 
of probability out through the surface is equal to the decrease in probability 
of finding the particle in the volume. 


= 


Find J for hydrogen in the state n= 2, | = 1, m = 1. Answer: 


A 


64x ma? re "/4 sind $. 


— 


(c) If we interpret mJ as the flow of mass, the arigular momentum is 


Lom [exnér. 


Use this to calculate Lz for the state W211, and comment-on the result. 


x x Problem 4.42 The (time independent) momentum space wave function in three 
dimensions is defined by the natural generalization of Equation 3.54: 


$p) = J e PP (y) d'r. [4.196] 


1 
Qnxh)P 
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(a) Find the momentum space wave function for the gtound state of hydrogen 
(Equation 4.80). Hint: Use spherical coordinates, setting the polar axis along 
the. direction of p. Do the 0 integral first. Answer: 

op) =~ e "are | [497 
Paa) Fap kir 

(b) Check that 9 (p) is normalized. 

(c) Use @(p) to calculate (p?), in the ground state of hydrogen. 

(d) What is the expectation value of the kinetic energy in this state? Express 
your answer as a multiple of E1, and check that it is. consistent with. the 
virial theorem (Equation 4.191). 

Problem 4.43 

| (a) Construct the spatial wave function (V) for hydrogen in the state m = 3, 
| = 2, m = 1. Express your answer as a function of r, 0, $, and à (the 
Bohr radius) only —no other variables (p, z, etc.) or functions (Y, v, etc.), or 
constants (A, co, etc.), or derivatives, allowed (zt is okay, and e, and 2, etc.). 

(b) Check that this wave function is properly normalized, by carrying out the: 
appropriate integrals over r, 0, and @. 

(c) Find the expectation value of r* in this state. For what range of s (positive 
and negative). is the result finite? 

Problem 4.44 


(a) Construct the wave function for hydrogen in the state n = 4; 1 = 3, m = 3. 
Express your answer as a function of the spherical coordinates r, 0, and $. 

(b) Find the expectation value of r in this state. (As always, look up any nontrivial 
integrals.) 

(c) If you could somehow measure the observable. L2 + L? on an atom in this 
state, what value (or values) could you get, and what is the probability of 
each? 


Problem 4.45 What is the probability that an electron in the ground state of hydro- 
gen will be found inside the nucleus? 


(a) First calculate the exact answer, assuming the wave. function (Equation 4.80) 
is correct all the way down to. r = 0. Let b be the radius of the nucleus. 


Toya aig Aas ae ig e 
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(b) Expand your result as a power series in the small number e = 2b/a, and 
show that the. lowest-order term-is the cubic: P ^: (4/3)(b/ a^. This should 
be a suitable approximation, provided that b «& a (which it is). 

(c) Alternatively, we might assume that v/(r) is essentially constant over the 
(tiny) volume of the nucleus; so that P ^: (4/3) b? |y (0)|?. Check. that you 
get the same answer this way. 

(d) Useb = 10-5 m and à 7 0.5 x 10719 m to get a numerical estimate for P. 
Roughly speaking, this represents the "fraction of its time that the electron 
‘spends inside the nucleus.” 


Problem 4.46 


(a) Use the recursion formula (Equation 4.76) to confirm that when / = n — 1 
the radial wave function takes the form 
n—i e qu 


Rag@—1) = Nar 


and determine the normalization constant Np by direct integration. 


E 


Calculate (r) and (r?) for states of the form. Vntn—lyn- 

Show that the “uncertainty” in r (o,) is (r)/4/2n +1 for such states. Note 
that the fractional spread in r decreases, with increasing n. (in this sense the 
system “begins to. look classical,” with identifiable circular “orbits,” for large 
n). Sketch the radial wave functions for several values of n, to illustrate this 
point. 


e 
~x 


Problem 4.47 Coincident spectral lines.*? According to the Rydberg formula 
(Equation 4.93) the. wavelength of a line in the hydrogen spectrum is determined 
by the principal quantum numbers of the initial and final states. Find two distinct 
pairs {n;, ^ f} that yield the same A. For example, (6851, 6409} and (15283, 11687} 
will do it, but you’re not allowed to use those! 


Problem 4.48 Consider the observables A = x? and B = L;. 
(a) Construct the uncertainty principle for c4og. 
(b) Evaluate og in the hydrogen state. Vj. 


(c) What can you conclude about (xy) in this state? 


43.Nicholas. Wheeler, “Coincident Spectral Lines” (unpublished Reed College report, 2001). 


a 
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Problem 4.49 An electron is in the spin state 


1—2i 
ei) 


(a) Determine the constant A by normalizing x. 

(b) If you measured S, on this electron, what. values could you get, and what is 
the probability of each? What is the expectation value of S,? 

(c) If you measured Sy on this: electron,. what values could you get, and what is 
the probability of each? What is the expectation value of $,? 


(d) If you measured Sy on this electron, what values could you get, and what is 
the probability of each? What is the expectation value of $,? 


* x *Problem 4.50 Suppose two spin-1/2 particles are known to be in the singlet config- 
uration (Equation 4.178). Let Sà be the component of the. spin angular momentum 
of particle number 1 in the direction defined by the unit vector à. Similarly, let 
se ? be the component of 2's angular momentum in the direction b. Show that 


h? | 
(Ss = ESL [4.198] 
where. 0 is the angle between @ and b. 


* x xProblem 4.51 


(a) Work out the Clebsch-Gordan coefficients for the case sı = 1/2, s2 = any- 
thing. Hint; You're looking for the coefficients A and B in 


ism) = Al} Dia (m — 2) + BIZ C- 3) 0n + 3). 


such that |s m} is an eigenstate of S2. Use the method of Equations 4.179 
through 4.182. If you can't figure out what S; (2) (for instance) does to |s2 m2), 
refer back to Equation 4.136 and the line before Equation 4.147. Answer: 


sy Him + 1/2 
25, +1 


527p m -- 1/2 


: JH - 
282 +1 


where the signs are determined by s = s; + 1/2. 


(b} Check this general result against three or four entries in Table 4.8. 
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Problem 4.52 Find the matrix representing Sx for a particle of spin 3/2 (using, as 
always, the basis of eigenstates of S). Solve the characteristic equation to determine 
the eigenvalues of Sy. 


x x Problem 4.53 Work out the spin matrices for arbitrary spin s, generalizing spin 1/2 
(Equations 4.145 and 4.147), spin 1 (Problem 4.31), and spin 3/2 (Problem 4.52), 


Answer: 
$ 0 0 0 
0 s—1 0 0 
S. =} 0 0 s—2 0 
0 0 0 —s 
0 bs 0 0 0 0 
Bs 0 by 0 0 0 
O bi 0 b 0 0 
s,-^|0 0 ba 0 0 0 
2 : » * 
0 0 0 0 0 — Poggi 
0 0 0 0 Ber. 0 
0 —ib; 0 0 0 0 
ib, 0 —ib,.| 0 bo 0 0 
0 ib, 0 —ib, 2 -> 0 0 
sat] 0 0 ik; 0 . Ô 0 
” 2 : 3 : : 
0 0 0 Oo er s CUL iB 
lo 0 0 Os xt HEC. O 


where 


bj = sd Gc1- J). 


x x «Problem 4.54 Work out the normalization. factor for the spherical harmonics, as 
follows. From Section 4.1.2 we know that 


Yr = Bie"? PI" cos By 


the problem is to determine the factor B” (which I quoted, but did not derive, 
in Equation 4.32). Use Equations 4.120, 4.121, and 4.130 to obtain a recursion 
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relation giving pr in terms of By”. Solve it by induction on m to get By” up to 
an overall MEE EÒ. Finally, use the result of Problem 4.22 to fix the EA 
You may find the following formula for the derivative of an associated Legendre 
function useful: 


2 PPP E 
ks P ss =VJV1—x2 PP! — mx P”. [4.199] 


Problem 4.55 The electron in a hydrogen atom occupies the combined spin and 
position state. 


(a) 


Rz (VIB X+ + Vaf Yi x-)- 


If you measured the orbital angular momentum squared (L?), what values 
might you get, and what is the probability of each? 


| Same for the z component of orbital angular momentum (L3). 


Same for the spin angular momentum squared. ( $2), 
Same for the z component of spin angular momentum (S,). 
Let J z L 4- S be the total angular momentum. 


If you measured J?, what values might you get, and what is the probability 
of each? 


Same for J.. 

If you measured the position of the particle, what is the probability density 
for finding it at r, 0, 6? 

If you measured both the. z component of the spin and the distance from the 
origin (note that these are compatible observables), what is the probability 


density for finding the particle with spin up and at radius 7? 


x x Problem 4.56 


(a) 


For a function f($) that can be expanded in a Taylor series, show that 
f+) =e!" fg) 


(where g is an arbitrary angle). For this reason, L;/h is called the. gener- 


ator of rotations about the z-axis. Hint: Use Equation 4.129, and refer to 


Problem 3.39. 


Further Problems for Chapter 4 197 


More generally, L -Z/h is the generator of rotations about the direction 7, in 
the sense that exp L- ig/f) effects a rotation through angle g (in the right- 
hand sense) about the axis ñ. In the case of spin, the generator of rotations 
is S- &/h. In particular, for spin 1/2 


x! — dame 2 X [4.200] 


tells us how spinors rotate. 


(b) Construct the (2 x 2) matrix representing rotation by 180° about. the x-axis, 
and show that it converts “spin up” (x+) into “spin down" (x_), as you 
would expect. 

(c) Construct the matrix representing rotation by 90? about the y-axis, and check 
what it does to x4. 


(d) Construct the matrix representing rotation by 360° about thé z-axis. If the 
answer is not quite what you expected, discuss its implications. 


(€) Show that n 
eM? — cos(o/2) +i - a) sin(g/2). [4.201] 


x Problem 4.57 The fundamental commutation relations for angular momentum 
(Equation 4.99) allow for half-integer (as well as integer) eigenvalues. But for 
orbital angular momentum only the integer values occur. There must be some 
extra constraint in the specific form L = r x p that excludes half-integer values. ^ 
Let a be some convenient constant with the dimensions of length (the Bohr radius, 
say, if we're talking about hydrogen), and define the operators. 


qi A E + (a? /h) py | > p= 4 [px — (h/a*)y]; 


n= |- 0p]; p= J [Pe + ay]. 

(a) Verify that [9,92] = [p1, P2] = 0; [q1, pil = [q2, p2] = ih. Thus the 
q’s and the p's satisfy the canonical commutation relations for position and 
momentum, and those of index 1 are compatible with those of index 2. 

(b) Show that 


i, = Fg ah + S t - pb 
EE x D LE 2 


44This problem is based on an argument in Ballentine (footnote. 26), page 127. 
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(c) Check that L, = Hj — H2, where each H is the Hamiltonian for a harmonic 
oscillator with mass m = h/a? and frequency w = 1. 


(d) We know that the eigenvalues of the harmonic. oscillator Hamiltonian are. 
(n+ 1/2), where n = 0, 1, 2, ... (In the algebraic theory of Section 2.3.1 
this follows from the form of the Hamiltonian and thé canonical commutation 
relations). Use this to conclude that the eigenvalues of L; must be integers. 


Problem 4.58. Deduce the condition for minimum uncertainty in S; and Sy (that 
is, equality in the expression os,cs, > (A/2)|(S;)|), for a particle of spin 1/2 in 
the generic state (Equation 4.139). Answer: With no loss of generality we can pick 
a to be real; then the condition for minimum. uncertainty is that b. is either pure 
real or else pure imagiiiary. 


x x «Problem 4.59 In classical electrodynamics the. force on a particle of charge q 
moving with velocity v through electric and magnetic fields E and B is given by 
the Lorentz force law: 

F =q Œ +y x B). [4.202] 


This force. cannot be expressed as the. gradient of a scalar potential energy function, 
and therefore the Schrédinger equation in its original form (Equation 1.1) cannot 
accommodate it. But in the more sophisticated form 


aw PERR 
ih = HW [4.203] 


there is no problem; the classical Hamiltonian? is. 


1 . 
H = — (p - 4A? +49, [4.204] 
2m 
where A is the vector potential (B = V x A) and o is the scalar potential (E = 


—Vq — 0A/8t), so the Schródinger equation (making the canonical ‘substitution 
p — (f/i)V) becomes | 


aw 1 fh 2 " 
ih -| ! (v aA) ee] Vy. [4.205] 


‘See, for example, H. Goldstein, C. P. Poole, and J. L. Safko, Classical Mechanics, 3rd ed;, 
(Prentice Hall, Upper Saddle River, NJ, 2002), page 342. 
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(a) Show that : i 
t m 


(b) As always (see Equation 1.32) we identify d(r)/dt with (v). Show that 


2 
n = gh) + px B-B xp) EA xB) [42071 
5 2m. m 


(c) In particular, if the fields E and B are uniform over the volume of the wave 


packet, show that 


mo) = (B+ (v) xB), [4.208] 


so the expectation value of (v) moves according to the Lorentz force law, as 
we would expect from Ehrenfest’s theorem. 


x x Problem 4.60 [Refer to. Problem 4.59 for background.] Suppose 


Bo, . ^ . 
A= 5 i — yi) and g= Kz’, 


where Bo and K aré constants. 
(a) Find the fields E and B. 
(b) Find the allowed energies, for a particle of mass m and charge q, in these 
fields, Answer: 


E(ni, n2) = (ny + Dh +(n2+ Dhor, (11,722 —0,1,2,...), [4.209] 


where œw; = qBo/m and œ = 2g K/m. Comment: If K = 0 this is the 
quantum analog to cyclotron motion; œ; is the classical cyclotron frequency, 
and it's a frée particle in the z direction. The allowed energies, (m4 + Shon, 
are called Landau Levels.*° 


* *Problem 4.61 [Refer to Problem 4.59 for background.] In classical electrodynam- 
ics the potentials A and @ are not uniquely determined;*/ the physical quantities. 
are the fields, E and B. 


46 For further discussion see Ballentine (footnote 26) Section 11.3. 
47See, for example, Griffiths (footnote 30) Section 10.1.2. 
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(a) Show that the potentials = - 


OA 
€ 29-75. A =A+VA [4.210] 
(where A is an arbitrary real function of position and time) yield the same 
fields as o and A. Equation 4.210 is called a gauge transformation, and the 
theory is said to be gauge invariant. 


(b) In quantum mechanics the: potentials play a more direct role, and it is of 
interest to know whether the theory remains gauge.invariant. Show that 


Wi = cidAliy [4211] 


sátisfies the Schródinger equation (4.205) with the gauge-transformed poteri- 
tials g and A’. Since W’ differs from Y only by a phase factor, it represents 
the same physical state, and the theory is gauge invariant (see Section 
10.2.3 for further discussion). 


48 That is to say, (r), d(r)/dt, etc. are unchanged. Because A depends on position, (p) (with p 
represented by the operator (A/i)V) does change, but as we found in Equation 4.206, p does not 
represent the mechanical inomentum (mv) in this context (in lagrangian mechanics il is so-called 
canonical momentum), 


CHAPTER 5 


IDENTICAL PARTICLES 


5.1 TWO-PARTICLE SYSTEMS 


For a single particle, W(r, f) is a function of the spatial coordinates, r, and the 
time, ¢ (we'll ignore spin, for the moment). The state of a two-particle system is 
a function of the coordinates of particle one (r1), the coordinates of particle two 
(r2), and the time: 


W(r1, r2, £f). [5.1] 
Its time evolution is determined (as always) by the Schrödinger equation: 
ow 
iA— = HW, 5.2 
n [5.2] 


whete. H is the Hamiltonian for the whole system: 


| at. i eec. P | 
H= 2m] Vi 2m2 V F y(r, r2, t) [5.3] 


(the subscript on V indicates differentiation with respect to the coordinates of 
particle 1 or particle 2, as the case may be), The statistical interpretation carries 
over in the obvious way: 


[Y (r1, r2, £)? dri d?r; [5.4] 


is the probability of finding particle 1 in the volume d^r, and particle 2 in the 
volume d°r2; evidently V must be normalized in such a way that 


J Iri, r2, (7 dr, Pr = 1. [5.5] 
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For time-independent potentials, we obtain a complete set of solutions by 
separation of variables: 


V (r1, 72, f) = rrj, raje 5/5, [5.6] 
where the spatial wave function (V^) satisfies the time-independent Schrödinger 
equation: 

n i? _, | 

am, VIV 7 au Vow Vr = EV, [5.7] 


and E is the total energy of the system. 


x Problem 5.1 Typically, the interaction potential depends only on the vector r = 
rj — r? between the two particles. In that case the Schrödinger equation separates, 
if we change variables from rj, rz to r and R = (mir, + mara)/(m, + mz) (the 
center of mass). 


(a) Show that r; = R + (u/mi)r, r; = R — (u/ma)r, and Vi = (u/ma)Vmg + 
Vr, V2 = (uf/mi)Vg — V;, where 
mma 
mi + m» 


h [5.8] 


is the reduced mass of the system. 


(b) Show that the (time-independent) Schródinger equation becomes 


n? 


~ 3m mp) vi - 2. LN Ve = Ew. 


(c) Separate the variables, letting y (R, r) = Vg (Ry; (r). Note that Yg satisfies 
the one-particle Schródinger equation, with the tota! mass (m; + mz) in 
place of m, potential zero, and energy Eg, while Y, satisfies the one-particle 
Schródinger equation with the reduced mass in place of in, potential V(r), 
and energy E. The total energy is the sum: E = Eg + Er. What this tells us 
is that the center of mass moves. like a free particle, and the relative motion 
(that is, the motion of particle 2 with respect to particle 1) is the same as if 
"we had à single particle with the reduced. mass, subject to the Poen V. 
Exactly the same decomposition occurs. in classical mechanics; it reduces 
the two-body problem to an equivalent one-body próblem. 


1 See, for example, Jerry B. Marion and Stephen T. Thornton, Classical Dynamics of Particles 
and Systéms, 4th ed., Saunders, Fort Worth, TX (1995), Section 8.2. 
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Problem 5.2 In view of Problem 5.1, we can correct for the motion of the nucleus 
in hydrogen by simply replacing the electron mass with the reduced mass. 


(a) Find (to two significant digits) the percent error in the binding energy of 
hydrogen (Equation 4.77) introduced by our use of m instead of m.. 


(b) Find the separation in wavelength between the red Balmer lines (n — 3 — 
n = 2) for hydrogen and deuterium. 

(c) Find the binding energy of positronium (in which the proton is replaced by 
a positron— positrons have the same mass as electrons, but opposite charge). 


(d) Suppose you wanted to confirm the existence of muonic hydrogen, in which 
the electron is replaced by a. muon (same. charge, but 206.77 times Heavies): 
Where. (i.e., at what wavelength) would you look for the “Lyman-a’ line 
(m=2—> n= 1)? 


Problem 5.3 Chlorine has two naturally occurring isotopes, CI? and CI?7. Show 
that the vibrational spectrum of HC] should consist of closely spaced doublets, with 
a splitting given by Av = 7.51 x 107^», where v is the frequency of the emitted 
photon. Hint: Think of it. as a harmonic oscillator, with œ =../k/j, where. p is 
the reduced mass (Equation 5.8) and ķ is presumably the same for both isotopes. 


5.1.1 Bosons and Fermions 


Suppose particle 1 is in the (one-particle) state ya (r), and particle 2 is in the state 
wp(r). (Remember: I’m ignoring spin, for the moment.) In that case y (r1, r2) is a 
simple product:? 


V(ri, r2) = Va (ri) V» (r2). [5.9] 


Of course, this assumes that. we can tell the particles apárt— otherwise it wouldn't 
make any sense to claim that number 1 is in state w, and number 2 is in state pp; 
all we could say is that one of them is in the state V; and the other is in state pp, 
but we wouldn't know which is which. If we were.talking classical mechanics this 
would be a silly objection: You can always tell the particles apart, in principle —just 


2k is emphatically not true. that every two-particle wave function is a product of two one-particle 
wave functions. There exist so-called entangled states that cannot be decomposed this way. However: 
If particle 1 is in state a and particle 2 is in state b, then the two-particle state is a product. I know 
what you're thinking: “How could particle 1 not. be in.some state, and particle 2 in some other state?” 
The classic example is the singlet: spin configuration (Equation 4.178)— I ‘can’t tell you the state of 
particle 1 by itself; because it is “entangled” (Schrédinger’s lovely word) with the state of particle 2. 
If 2 is measured, and found to be spin up, then 1 is spin down, but if 2 is spin dowri, then 1 is spin up. 
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paint one of them red and the other one blué, or stamp identification numbers on 
them, or hire private detectives to follow them around: But in quantum mechanics 
the situation is fundamentally different: You can’t paint an electron red, or pin a 
label on it, and a detective’s observations will inevitably and unpredictably alter 
its state, raising doubts as to whether the two had perhaps switched places. The 
fact is, all electrons are utterly identical, in a way that no two classical objects can 
ever be. It’s not just that we don’t happen to know which electron is which; God 
doesn't know which.is which, because there is no such thing as “this” electron, or 
"that" electron; all we can legitimately speak about is “ari” electron. 

Quantum mechanics neatly accommodates the existence of particles that are 
indistinguishable in principle: We simply construct a wave function that is non- 
committal às to which particle is in which state. There are actually two ways to 
do it: 

V (r1, 72) = Alwar) Vs (r2) E Vo Gri) Va (x2)]. [5.10] 


Thus the theory admits two kinds of identical particles: bosons, for which we use 


‘the plus sign, and fermions, for which we use the minus sign. Photons arid mesons 
are. bosons; protons and electrons are fermions. It so happens that 


| all particles with integer spin are bosons, and [5.11] 


all particles with half integer spin are fermions. 


This connection between spin and statistics (as we shall see, bosons and fermions. 
have quite: different statistical properties) can be proved in rélativistic quantum 
mechanics; in the nonrelativistic theory it is taken as an axiom.? 

It follows, in particular, that. two identical fermions (for example, two elec- 
trons) cannot occupy the same state. For if Ya = Wy, then 


V- (1. r2) = Alvar) Vra (r2) — vat s 2)] = 0, 


and we are left.with no wave function at all.4 This is the famous Pauli exclusion 
principle. It is not (as you may have.been led to believe) a weird ad hoc assumption 
applying only to electrons, but rather a consequence of the rules for constructing 
two-particle wave functions, applying to all identical fermions. 

I assumed, for the sake of argument, that one particle was in the state Wa 
and the other in state. yj, but there is a more general (and more. sophisticated) 


It seems bizarre that relativity should have anything to do with it, arid there has. been a lot 
of discussion recently as to whether it might be possible to prové the spiri-statistics connection. in 
other (simpler) ways. Sce, for example, Robert C. Hilborn, Am. J. Phys. 63, 298 (1995); Tan Duck and 
E. C. G. Sudarshan, Pauli and the Spin-Statistics Theorem, World Scientific, Singapore (1997). 

^T'm still leaving out the spin, don't fotgct—if this bothers you (alter all, a spinless fermion 
is a contradiction in terms), assume they're in the same. spin state. I'll incorporate spin explicitly in a 
moment. 


m ene 
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way to formulate the problem. Let us define the exchange operator, P, which 
interchanges the two particles: 


Pf(ri, v2) = f (12, r1). [5.12] 


Clearly, P? — |, and it follows (prove it for yourself) that the eigenvalues of P are 
+ 1. Now, if the two particles are identical, the. Hamiltonian must treat them the 
same: mj = m» and V (rj, t2) = V (ro, ri). It follows that P and H are compatible 
observables, 

LP 47] 0. [5.13] 


and hence we can find a complete set of functions that are simultaneous eigenstates 
of both. That is to say, we can find solutions to the Schródinger equation that 
are either symmetric (eigenvalue +1) or antisymmetric (eigenvalue —1) under 
exchange: 


V(ri,r2) = XV», r1). [5.14] 


Moreover, if a system starts out in such a state, it will remain in such a state. The 
new law (I'll call itthe symmetrization requirement) is that for identical particles 
the wave function is not merely allowed, but required to satisfy Equation 5.14, 
with the plus sign for bosons, and the minus sign for fermions.? This is the general 
statement, of which Equation 5.10 is a special case. 


Example 5.1 Suppose we have two noninteracting—they pass right through one 
another ... never mind how you would set this up in practice! — particles, both of 
mass m, in the infinite square well (Section 2.2). The one-particle states are 


(where K = nh / 2ma?, for convenience), If the particles are distinguishable, with 
#1 in state ny and #2 in state n2, the composite wave function is a simple product: 


Vnin; (x1. X2) = Vin, 1) Wing G2). Enin = (nj + n3)K . 


“It is sometimes stiggested that the symmetrization requirement (Equation 5.14) is forced by 
the fact that P and H commute. This. is false: It is perfectly possiblé.to imagine a systém öf two 
distinguishable particles (say, an eléctron and'a positrón) for which the: Hamiltonian is symmetric, and 
yet there is no requirement that the: wave function be symmetric (or antisymmetric). But identical 
particles have to. occupy symmetric or antisymmetric states, and this is a completely new fundamental 
law —on a par, logically, with Schródinger's equation and the statistical interpretation. Of course, there 
didn’t. have to be any such things as identical particles; it could have been that every single particle 
in nature was distinguishable from. every other one. Quantum mechanics allows for the possibility of 
ideritical particles, and nature (being lazy) seized the opportunity. (But I'm tot. complaining—this 
makes matters enormously simpler!) ` l 
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For example, the ground state is + 
2 
yi = —sinGrxj/a)sin(zxo/a), Ey, = 2K; 
a 
the first excited state is doubly. degenerate: 


2 : , 
Yi = " sin(xxi/a)sin(2zx?/a), Eq, =5K, 
ü ! 


v m 
Wil =a sin(Zz xy /a)sin{zx2/a), | Ey =5K; 


and. so on. If the two particles are identical bosons, the ground state is unchanged, 
but the first excited state is nondegenerate: 


X2 
oe) [sin(zr x1 /a) sinr x5/a) + sin2ztx, /a) sin(x x2/a)] 
ü / 
(still with energy 5K). And if the particles are identical fermions, there is no state 
with energy 2K; the ground state is 
= [sinGrx1 /a) sin(2zt x2/a) — sin(2z x1 /a) sin(x2/a)|, 


and its energy is 5K. 


xProblem 5.4 
(a) If Ya and yp are orthogonal, and both normalized, what is the constant A in 
Equation 5.10? 


(b) If Ya = Yh (and it is normalized), what is A? (This case, of course, occurs 
only for bosons.). 


Problem 5.5 


(a) Write down the Hamiltonian for two noninteracting identical particles in the 
infinite square well. Verify that the fermion ground state given in Example 5.1 
is an eigenfunction of H, with the appropriate eigenvalue. 

(b) Find the next two excited states (beyond the ones in Example 5.1)— wave 
functions and energies—for each of the three cases (distinguishable, identical 
bosons, identical fermions). 
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5.1.2 Exchange Forces 

To give you some sense of what the symmetrization requirement actually does, I’m 
going to work out a simple one-dimensional example. Suppose one particle is in 
state: Va (x), and the other is in state v5 (x), and these two states are orthogonal 
and normalized. If the two particles are distinguishable, and number 1 is the one 
in. state 3/4, then the combined wave function is 


Vr Ga, x2) = Va) Ve 2); [5.15] 


if they are identical bosons, the.composite wave function is (see Problem 5.4 for 
the. normalization) 


Va. x2) = lta ed vox) + op (x1) Ya (%2)]; [5.16] 


and if they are identical fermions, it is 


V- Gu. x2) = nh (x1) Va G2) — Vs Ga) Va Go2)]. [5.17] 


Let's calculate the expectation value of the square of thé. separation distance: 
between the two particles, 


(Qi — 22") = (x1) + (x2) — 2032). [5.18] 
Case 1: Distinguishable particles. For the wavefunction in Equation 5.15, 
(x1) = J dWaGi) dxi | iro GI" dx = (x^)a 
(the expectation value of x? in the one-particle state Ya), | 
03) feo as f diveca ass = n. 


and 


61) = J xilva (xy) |? dx J xalVz Ga)l dxa = 8)a(x)p. 
In this case, then, 
(9x — x2)534 = (x7) + ys — 2095005. [5.19] 


(Incidentally, the answer would, of course, be the same if particle 1 had been in 
state 3/5, and particle 2 in state W.) 
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Case 2: Identical particles. For the wave functions in Equations 5.16 and 5.17, 
(xt) = ; | J xia (a) P dx J Wea)? dx2 
+ f xi lyp GP dxi | IVa (xa)? dx 
+ [stis cote n f e a n do 
T J xiy J Va Ga) dxi | Va (x2)* Wp (x2) 24 
= ; [a +2), bot o] = ; (e $ Gy.) 


Similarly, 


(xz) = ; (6, + uh) 


(Naturally, (x2) = (x2), since you can't tell them apart.) But 
(x2) = Lf x1 a GP dar | xal GP dxa 
+ J xilia Ga)? dxi | x2 [Wa (x2)? dx; 
= f Xi palsi) Wp (x1) dxi | X2 Vy (X2)" Vra (x2) dx 
t J oio Go Va Q0) dx | xia Go ro Q) dix | 


1,. l l . l 
= z(a l)a  G)s(x)a E (Xab (x) ba T (x) ba (X)ab) 


= (x). (hs + |e) abl’, 


where ; 
(x)gp = J x Wa GO Wn (x) dx. [5.20] 
Evidently 
(Gi. — xiy)4. = Ua + G2) — 204002 T 2| 04417. [5.21] 


Comparing Equations 5.19 and 5.21, we see that the difference resides in the 
final term: 
(A295) 4 = (Ax)! x 21 (x)an/?. [5.22] 
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F + T F ae 


(b) 


FIGURE 5.1: Schematic picture of the covalent bond: (a) Symmetric configuration 
produces attractive force. (b) Antisymmetric configuration produces repulsive force. 


Identical bosons (the upper signs) tend to be somewhat closer together, and identical 
fermions (the lower signs) somewhat farther apart, than distinguishable particles 
in the same two states. Notice that (x)ap vanishes unless the two wave func- 
tions actually overlap [if (x) is zero wherever W(x) is nonzero, the integral 
in Equation 5.20 is zero]. So if Ya represents an electron in an atom in Chicago, 
and V represents: an electron in an atom in Seattle, it’s not going to make any 
difference whether you antisymmetrize the wave function or not. As a practical 
matter, therefore, it’s okay to pretend that electrons with nonoverlapping wave 
functions. aré distinguishable. (Indeed, this is the only thing that allows physicists 
and chemists to proceed at all, for in principle every electron in the. universe is 
linked. to every other one, via the antisymmetrization of their wave functions, and 
if this really mattered, you wouldn't be able to talk about anyone unless you were 
prepared to deal with them ail!) 

The: interesting case is when there is some overlap of the wave functions. 
The system behaves as though there were a “force of attraction" between identical 
bosons, pulling them closer together, and a "force of repulsion" between identi- 
cal fermions, pushing them apart (remember that we are for the moment ignoring 
spin). We call it an exchange force, although it’s not really a force at all—no 
physical agency is pushing on the particles; rather, it is a purely geometrical conse- 
quence of the symmetrization requirement. It is also a strictly quantum mechanical 
phenomenon, with no classical counterpart. Nevertheless, it. has profound consé- 
quences. Consider, for example, the hydrogen molecule (H5). Roughly speaking, 
‘the ground state consists of one electron in the atomic ground state (Equation 4.80) 
centered on nucleus 1, and one. electron in the atomic ground state: centered at 
nucleus 2. If electrons were bosons, the symmetrization requirement (or, if you 
like, the “exchange force") would tend to concentrate thé electrons toward the 
middle, between the two protons. (Figure 5.1(a)), and the resulting accumulation 
of negative charge would attract the protons inward, accounting for the covalent 
bond.® Unfortunately, electrons aren't bosons, they're. fermions, and this means 
that ‘the concentration of negative charge should actually be shifted to the wings 
(Figure 5.1(b)), tearing the molecule apart! 


6A covalent bond occurs when shared electrons congregate betwecn the nuclei, pulling the atoms 
together, It need not involve, two: electrons—in Section 7.3 we'll encounter a. covalent bond with just 
one electron. 
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5.2 ATOMS 


But wait! We have been ignoring spin. The complete state of the electron 
includes not only its position wave function, but also a spinor, describing the 
orientation of its spin: 


V (x (s). [5.23] 


When we put together the two-electron state, it is the whole works, not just the 
spatial part, that has to be antisymmetiic with respect to exchange. Now, a glance 
back at the composite. spin states (Equations 4.177 and 4.178) reveals that the singlet 
combination is antisymmetric (and hence would have to be joined with a symmetric 
spatial function), whereas the three triplet states are all symmetric (and would require 
an antisymmetric spatial function). Evidently, then, the singlet state should. lead to 
bonding, and the triplet to anti bonding. Sure enou gh, the chemists tell us that covalent 
bonding requires the two electrons to occupy the singlet state, with total spin zero.? 


xProblem 5.6 Imagine two noninteracting particles, each of mass m, in the infinite 
.square well. If one. is in the state ,, (Equation 2.28), and the other in. state yy 
(| x m), calculate (xi — x2)?), assuming (a) they are distinguishable particles, 
(b) they are identical bosons, and (c) they are identical fermions. 


Problem 5.7 Suppose you had three. particles, one in state. V, (x), one in state 
V5 (x), and one in state Vr. (x). Assuming Wa, Wp, and v; are orthonormal, construct 
the three-particle states (analogous to Equations 5.15, 5.16, and 5.17) representing 
(a) distinguishable particles, (b) identical bosons, and (c) identical fermions. Keep 
in mind that (b) must be completely symmetric, under interchange of any pair of 
particles, and (c) must be completely antisymmetric, in the same sense. Comment: 
There's a cute trick for constructing completely antisymmetric wave functions: 
Form the Slater déterminant, whose first row is. a (41), We), VeGxi), etc, 
whose second row is Va(x2), Ws(x2), Vr; (x2), etc., and so on (this device works 
for any number of particles). 


A neutral atom, of atomic number Z, consists of a heavy nucleus, with electric: 
charge Ze, surrounded by Z. electrons (mass m and charge —e). The Hamiltonian 


Tin the absence of coupling between spin and position, we are free to assume that the. state is- 
‘separable in its spin and spatial coordinates. This just says that the probability of getting spin up. is. 
independent of the location of the particle. In the. presence of coupling, the general state would take 
the form of a linear combination: ¥+(r)x+ + V (r) x—, as in Problem 4. 55. 


5In castal language, it i$ often said that the. electrons are “oppositely aligned” (one with spin 
up, and the other with spin down). This is something of an oversimplification, since. the. same could be: 
said of the: m = 0 triplet state. The precise statement is that they are. in the singlet configuration. 
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for this system is? 
Z f :2 ; \ 2 / y Z 2 
n" s "d LZ lf 1 
#= > -5v (s IESEH (Ge) t. B2] 
p 2m \4reg] rj 2 \47e9 / ir Ir; — rz] 


The term in curly brackets represents the. kinetic plus potential energy of the jth 
electron, in the electric field of the nucleus; the second sum (which runs over 
all values of j and k except j = k) is the potential energy associated with the 
mutual repulsion of the. electrons (the factor of 1/2 in froüt corrects for the fact 
that the summation counts each pair twice). The problem is to solve Schródinger's 
equation, 

Hy = Ey, [5.25] 


for the. wave function V (r1, P2, ... , rz). Because electrons are identical fermions, 
however, not all solütions are acceptable: only those for which the complete state 
(position and spin), 


Vi, T2; Eira y rz)X(si. 82, ana) sz); [5.26] 


ïs antisymmetric with respect to interchange of any two electrons. In particular, no 
two electrons can occupy the same state. 

Unfortunately, the Schródinger equation with Hamiltonian in Equation 5.24 
cannot: be solved exactly (at any rate, it hasn't been), except for the very simplest 
case, Z = 1 (hydrogen). In practice, one must resort to elaborate approximation 
methods. Some of these we shall explore in Part IL; for now I plan only to sketch 
some qualitative features of the solutions, obtained by neglecting the. electron repul- 
sion term altogether. In Section 5.2.1 we'll study the. ground state and excited states 
of helium, and in Section 5.2.2 we'll examine the ground states of higher atoms. 


Problem 5.8 Suppose you could find a solution (Y (r1, r2, ... , rz)) to the Schró- 
dinger equation (Equation 5.25), for the Hamiltonian in Equation 5.24. Describe. 
how you would construct from it a completely symmetric function and.a completely 
antisymmetric function, which also satisfy the Schródinger equation, with the same 
energy. 


?rm assuming the nucleus is stationary. Thé trick of accounting for nuclear motion’ by using the 
réducéd mass (Problem 5.1) works only for the two- -body problem; fortunately, the nucleus is so much 
inore massive than the electrons that the correction is extremely small even in the. case of hydrogen 
(see Problem 5.2(a)), and it is smaller still for the heavier atoms. There are. more. interesting effects, 
duc to magnetic interactions associated with electron spin, relativistic.corrections, and the finite size of 
the nueleus. We'll look into these in later chapters, but all of them are, minute corrections to the "purely 
coulombic” atom described by Equation 5.24. 


E 
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5.2.1 Helium v 


After hydrogen, the simplest. atom is helium (Z = 2). The Hamiltonian, 


: HR o, 1 22 h2 1 2e 1 e | 
H=} —— Vy ——— dad y2 ——Á— ST 
| 2m | 4zteg r| E 2m ? 4meo ra JT res Ir; — í| l | 


consists of two hydrogenic Hamiltonians (with nuclear charge 2e), one for electron 
1 and one for electron 2, together with a final term describing the repulsion of the 
two electrons. It is this last term that causes all the trouble. If we simply ignore it, 
the Schródinger equation separates, and the solutions can be. written as products of 
hydrogen wave functions: 


(tt, £2) = Vim (rer (r2), [5.28] 


only with half the Bohr radius (Equation 4:72), and four times the Bohr energies 
(Equation 4.70)—if you don’t see why, refer back to Problem 4.16. The total 
energy would be 


where E, — —13.6/n? eV. In particular, the ground state would be 


vo! ;T2) = "ioo(ri1)Vrioo(r2) — aie Xr *r2)/a [5.30] 
(see Equation 4.80), and its energy would be 
Eo = 8(—13.6 eV) = —109 eV. [5.31] 


Because vj is a symmetric function, the spin state has to be antisymmetric, so the. 
ground state of helium should be a. singlet configuration, with the spins “oppo- 
sitely aligned." The actual ground state of helium is indeed a singlet, but the i 
experimentally determined energy is —78.975 eV, so the agreement is not very. 
good. But this is hardly surprising: We ignored electron repulsion, which is cer- 
tainly not a small contribution. It is clearly positive (see Equation 5.27), which is 
comforting —evidently it brings the total energy up from —109 to —79 eV (see 
Problem 5.11). 

The excited states of helium consist of one electron in the hydrogenic ground 
státe, and thé other in an excited state: 


Whim Wioo : [5 332] 


[If you try to put both electrons in excited states, one immediately drops to the 
ground state, releasing enough energy to knock the other one into the contin- 
uum (E > 0), leaving you with a helium ion (Het) and a free electron. This 
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is an interesting system in its own right—see Problem 5.9—but it is not our 
present concern.] We can construct both symmetric. and antisymmetric combina- 
tions, in the usual way (Equation 5.10); the former go with the antisymmetric spin 
configuration (the singlet), and they are called parahelium, while the latter require 
à symmetric spin configuration (the triplet), and they aré known as orthohelium, 
The ground state is necessarily parahelium; the éxcited states come in both forms. 
Because the symmetric spatial state brings the electrons closer together (as we dis- 
covered in Section 5.1.2), we expect a higher interaction energy in parahelium, and 
indeed, it is experimentally confirmed that the parabelium states have somewhat 
higher energy than their orthohelium counterparts (see Figure 5.2). 


Parahelium Orthohelium 
is P ip F 38 P D F 
0 : 
aal 4P+ 4DF 4F 4P- 
E 4S H 48 4 4D+ 4F 
SPT 3D 3D 
354 3P 
, 3S + 
2. 
> 
o 
5 -3 
$ l 
i 2P 
2P 
E 2S 
2S 
(1S.at -24.5 eV) 


FIGURE 5.2: Energy level diagram for helium (the notation is explained in 
Section 5.2.2). Note that parahelium energies. are uniformly higher than their orthohe- 
lium counterparts. The numerical values on the vertical scale are relative to the ground 
state of ionized helium (He*): 4 x (—13.6) eV = —54.4 eV; to get the total energy of 
the state, subtract 54.4 eV.. 


214 Chapter $ Identical Particles 


Problem 5.9 
(a) Suppose you put both electrons in a helium atom into the n = 2 state; what 
would the energy of the emitted electron be? 


(b) Describe (quantitatively) the spectrum of the helium ion, Het, 


Problem 5.10 Discuss. (qualitatively) the energy level scheme for helium if 
(a) electrons were identical bosons, and (b) if electrons were distinguishable parti- 
cles (but with the. same mass and charge). Pretend these “electrons” still have spin 
1/2, so the spin configurations are the singlet and the triplet. 


s-Problem 5.11 


(a) Calculate ((1/|ri — r2|)) for the state yo (Equation 5.30). Hint: Do the d?r; 
integral first, using spherical coordinates, and setting the polar axis along r1, 


so that 
Ir; —Y2| = Sri T r2 — 2rır cos 65. 


The 6 integral is easy, but be careful to take the positive root. You'll have 
to break the rz intégral into two pieces, one ranging from 0 to r1, the other 
from rı to co. Answer: 5/4a. 


(b) Use your result in (a) to estimate the electron interaction energy iü the 
ground state of helium. Express your answer in electron volts, and add it 
to Eg (Equation 5.31) to get a corrected estimate of the ground state energy. 
Compáre the experimental value. (Of course, we're still working with an 
approximate wave function, so don't expect perfect agreement.) 


5.2.2 The Periodic Table 


The ground state electron configurations for heavier atoms can be pieced together in 
much the same way. To first approximation (ignoring their mutual repulsion alto- 
gether), the. individual electrons occupy one-particle hydrogenic states (m, l, m), 

called orbitals, in the Coulomb potential of a nucleus with charge Ze. If elec- 
trons were bosons (or distinguishable particles) they would all shake down to the 
ground state (1, 0, 0), and chemistry would be very dull indeed. But electrons are 
in fact identical fermions, subject to the Pauli exclusion principle, so only two 
can occupy any given orbital (one. with spin up, and one with spin down—or, 

more precisely, in the singlet configuration). There are n? hydrogenic wave func- 
tions (all with the same energy En) for a given value of n, so the n = I shell 
has room for 2 electrons, the n = 2 shell holds 8, n = 3 takes 18, and in gen- 
eral the nth shell can accommodate 275? electrons. Qualitatively, the horizontal 
rows on the Periodic Table correspond to filling out éach shell (if this were the: 
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whole. story, they would have lengths 2, 8, 18, 32, 50, etc., instead of 2, 8, 8, 
18, 18, etc.; we'll see in a moment how the electron-electron repulsian throws the 
counting off). 

With helium, the n = 1 shell is filled, so the next atom, lithium (Z = 3), has 
to. put one electron into the: n = 2 shell. Now, for n = 2 we can have / = 0 or 
i = 1; which of these will the third electron choose? In the absence of electron- 
electron interactions, they both have the same energy (the Bohr energies. depend 
on i1, remember, but not on /). But the effect of electron repulsion is to favor the 
lowest value of /, for the following reason. Angular momentum tends to throw 
the electron outward, and the farther out it gets, the more effectively the inner 
electrons sereen the nucleus (roughly speaking, the innermost electron "sees" the 
full nuclear charge. Ze, but the outermost electron sees an effective charge hardly 
greater than e). Within a given shell, therefore, the state with lowest energy (which 
is to say, the most tightly bound electron) is Z = 0, and the energy increases with 
increasing /. Thus the third electron in lithium occupies the orbital (2, 0, 0). The 
next atom (beryllium, with Z — 4) also fits into this state (only with "opposite 
spin"), but boron (Z — 5) has to make use of / — 1. 

Continuing in this way, we reach neon (Z = 10), at which point the n = 2 
shell is filled, and we advance to the next row of the periodic table and begin to 
populate the. n = 3 shell. First there are two atoms (sodium and magnesium) with 
| = 0, and then there are six with / = 1 (aluminum through argon). Following 
argon there “should” be 10 atoms with n = 3 and / = 2; however, by this time the 
‘screening effect is so strong that it overlaps the next shell, so potassium (Z — 19) 
and calcium (Z = 20) choose n = 4, | = 0, in preference to n = 3, 1 = 2. After 
that we drop back to pick. up the n. = 3, | = 2 stragglers (scandium through zinc), 
followed by n = 4, 1 = 1 (gallium through krypton), at which point we again make 
a premature jump to the next row (n = 5), and wait until later to slip in the / = 2 
and | = 3 orbitals from the n = 4 shell. For details of this intricate counterpoint I 
refer you to ahy book on atomic physics. i0 i 

I would be delinquent if I failed to mention the archaic nomenclature for 
atomic states, because all chemists and most physicists. use it (and the people who 
make up the Graduate Record Exam løve this kind of thing). For reasons known 
best to nineteenth century spectroscopists, / = 0 is called s (for sharp"), | = 1 is p 
(for principal"), / = 2 is d (“diffuse”), and / = 3 is f ("fundamental"); after that I 
guess they ran out of imagination, because it now continues alphabetically (e, A, i, 
but skip j —just to be utterly perverse, k, Z, etc.).! The state of a particular electron 
is represented by the pair ni, with n (the. number) giving the shell, and / (the letter) 


~ 


See, for examplé, U. Fano arid L. Faro, Basic Physics of Atoms and Molecules, Wiley, New 
York (1959), Chapter 18, or the-classic by G; Herzberg, Atomi¢ Spectra and Atomic Structure, Dover, 
New York (1944). l 

| The shells themselves are assigned equally arbitrary. nicknames, starling (don't ask inc why) 
wilh K: The K shell is æ = 1, the Z shell is n = 2, M is # = 3, and so on (at least they're in 
alphabetical order). 
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specifying the orbital angular momentum; the magnetic: quantum number ri is not 
listed, but an exponent is used to indicate the number of electrons that occupy the 
state in question. Thus the configuration 


(152 (2s)? (2p)? [5.33] 


tells us that there are two electrons in the orbital (1, 0. 0), two in the orbital (2, 0, 0), 
and two in some combination of the orbitals (2, 1, 1), (2, 1, 0), and (2, 1, —1). This 
happens to be the ground state of carbon. . 

In that example there are two electrons with orbital angular momentum quan- 
tum number 1, so the total orbital angular momentum quantum number, L (capital 
L, instead of Z, to indicate that this pertains to the total, not to any one particle) 
could be 2, 1, or 0. Meanwhile, the two (1s) electrons are locked together in the 
singlet state, with total spin zero, and so are the two (2s) electrons, but the two 
(2p) electrons could be in the singlet configuration or the triplet configuration. So 
the total spin quantum number S (capital, again, because it's the total) could be 1 
-or 0, Evidently the grand total (orbital plus spin), J, could be 3; 2, 1, or 0. There 
exist rituals, known as Hund’s Rules. (see. Problem 5.13) for figuring out what 
these totals will be, for a particular atom. The result is recorded as the following 
hieroglyphic: 

28+1 Ly, [5.34] 


(where S and J are the numbers, and L the letter—capitalized, this time, because 
we're. talking about the totals). The ground state of carbon happens to be ? P5: 
the total spin. is 1 (hence the 3), the total orbital angular miomentum is 1 (hence 
the P), and the grand total angular momentum is zero (hence the 0). In Table 5.1 
the individual configurations and the total angular momenta (in the notation of 
Equation 5.34) are listed, for the first four rows of the Periodic Table.!? 


xProblem 5.12 


(a) Figure out the electron configurations (in the notation of Equation 5.33) for 
the first two rows of the Periodic Table (up to neon), and check your results 
against Table 5.1. 

(b) Figure out the corresponding total angular momenta, in the notation of Equa- 


tion 5.34, for the first four elements. List all the possibilities for boron, car- 
bon, and nitrogen. 


12 After krypton—célemerit 36— the: situation gets.more complicated (fine sttueture starts: to play 
a significant role in ihe ordering of the states) so it is not for want of space that the table terminates 
there. 
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TABLE 5.1: Ground state electron configu- 
rations for the first four rows of the Periodic 
Table. 


N 


Element Configuration 

H (1s) 7$yo 
He (is)? Ils, 0 
Li (He)(2s) ?$15 
Be (He)(2s)* ISo 
B (He)(2s)°(2p) "Pin 
C (He)(2s)°(2p)? 3Pg 
N (He)Qs)Qp)* ^S. 
O (He)(2s)?(2p)* 3P, 
3 

N 

N 


OO NA uja Why = 


(He)(2s°2p)° Psp 
Ne (He)(2s)*(2p)® | 
a (Ne)(35) 
12 Mg — (NoOGs? , 
13 Al (Ne)GsY?(3p) Pin 
14 Si (Ne)(3s)2(3p)* 3Py 
15 P (Ne)Gs)?(3p)* ^$3/ 
16 S (Ne)(3s)*Gp)* 3p, 
(Ne)(3s) (3p? l 
8 (Ne)Gs)°3p)* 
19 K (Ary4s) ?$15 
20 Ca (Ar)(4s)? 150 
21 Sc (Ar 45)? (34) ?2Dap, 
22 Ti (AD(4sP (34 )* 3F, 
23 V (An (45)! Gd P “Fp 
24 Cr (Ar)(4s)(3d P 7$; 
25 Mn (Arn 4s Gd P 68/5 
26 Fe (Ar 4s)*(3d 6 5p, 
27 Co (ADAP BAY yl 
28 Ni (Ar (4s)? 3d 8 Fa 
29 Cu (Ar)(4s)(3d)! 2512 
30 Zn (Ar 45)? (34)! 'So 
31 Ga (Ar)(4s)2(3d)!$(4p)  ?Pjp 
32 Ge (Ar)(4s°(3d) (4p)? 3 Pa 
33 As — (AD(4s yd) 9p “San 
34 Se (Ar)(4s)2(3d)%(4p)4 3P 
35 Br (An(4sGd) (4p?  ?P35 
36 Kr — (Ans?Gd) 4p tSo 
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x Problem 5.13 


(a) Hund's first rule says that, consistent with the Pauli principle, the state with 
the highest total spin (S) will have the lowest energy. What would this predict 
in the case of the excited states. of helium? 


(b 


— 


Hund's second rule says that, for a. given spin, the state with the highest total 
orbital angular momentum (L), consistent with overall antisymmietrization, 
will have. the lowest energy. Why doesn't carbon have L = 2? Hint: Note 
that the "top of the ladder" (Mr, — L) is symmetric. 


Hund's third rule says that if a subshell (5, /) is no more than half filled, 
then the lowest.energy level has J = |L —S|;.if itis more than half filled, then 
J = L + S has the lowest energy. Use this. to resolve the boron ambiguity 
in Problem 5.12(b). 


a 


| qd 


— 


Use Hund’s rules, together with the fact that a symmetric spin state must go 
with an antisymmetric position state. (and vice versa) to resolve the: carbon 
and nitrogen ambiguities in Problem 5.12(b). Hint: Always go to the “top of 
the ladder" to figure out the symmetry of a state. | 


Problem 5.14 The ground state of dysprosium (element. 66, in the 6th row of the 
Periodic Table) is listed as ? /g. What are the total spin, total orbital, and grand total 
angular momentum quantum numbers? Suggest a likely electron configuration for 
dysprosium. 


5.3 SOLIDS 


In the solid state, a few of the loosely. bound outermost valence electrons in each 
atom become detached, and roam around throughout the material, no longer subject 
only to the.Coulomb field of a specific *parent" nucleus, but.rather to the combined 
potential of the entire crystal lattice. In this section we will examine two extremely 
primitive models: first, the electron gas theory of Sommerfeld, which ignores. all 
forces (except the confining boündaries), treating the wandering electrons as free 
particles in a box (the three-dimensional analog to an infinite square well); and 
second, Bloch's theory, which introduces a periodic potential representing the elec- 
trical attraction of the regularly spaced, positively charged, nuclei (but still ignores 
électron-electron repulsion). These models are no more than the first halting steps 
towárd a quantumi theory of solids, but already they reveal the critical role of 
‘the Pauli exclusion principle in accounting for “solidity,” and provide illuminating 4 
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insight into the remarkable electrical properties of conductors, semi-conductors, 
and insulators. 

5.3.1 The Free Electron Gas 

Suppose the object in question is a rectangular solid, with dimensions. ly, ly, be, 
and imagine that an electron inside experiences no forces at all, except at the 
impenetrable walls: 


0, if0<x<k, 0«y«lyand 0 <z <k; 


oo, otherwise. [5.35] 


V(x,y,z2)-— | 


The Schródinger equation, 
h2 
—— Vw = Ey, 
2m. 


separates, in cartesian coordinates: w(x, y, z) = X Q)Y () ZG), with 


h? dX 0 BY ,. h? dz 
->a Bek yg BY; -aoga = Eee 
2m dx? 2m dy . 2m dz? 


and E = Ex + Ey + E;. Letting 


‘ [2m Ex | JBmE, 


h ? 


we óbtain the. general solutions. 


X (x) = Axsin(kx) + Bycos(kyx), | Y (y) = Aysin(ky y) + By cos(ky y), 
Z(z) = Az sin(kzz) + B; cos(k,Z). 


The boundary conditions require that X(0) = Y(0) = Z(0) = 0, so By = By = 
B; — 0, and X (x) = Y (ly) = Z(lz) = 0, so that 


kile = nsf, kyy-nym, kd n, [5.36] 
where each n is a positive integer: 
mah ray Nyse ayaa LA wes [5.37] 


The (normalized) wave functions are 


| | 8 , (nye \ . (nyt fnn ) ees 
SE ae Cove US x | sin | ——y | sir z), 5.38 
Vn cn yn; Lll; sin ( ly x) sın ( Ly ») sin ( lz z [ ] 
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and the allowed energies are 


Ren? [n2 n É\ PE oe 
I (3 zm e Ou [5.39] 


S qo ud 
2m (BY m " 4 2m ' 


Enynyn:z = 


where k is the magnitude of the wave vector, k = (kx, ky, kz). 

If you imagine a three-dimensional space, with axes ky, ky, kz, and planes 
drawn in at ky = (m/l), (20 /lx), (31/1), ..., at ky = Gr/1y), Qx/1y), (32/1), 
S, and at kz = (w/lz), (2m/1,), Ga /Iz),...., each intersection potnt represents 


FIGURE 5.3:  Freé electron gas. Each intersection on the grid represents a stationary 
state. Shading indicates one “block”; there is one state for every block. 
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a distinct (one-particle) stationary state (Figure 5.3). Each block in this grid, and 
hence also each state, occupies a volume 


z? x? 
eT 5.40 
khl y pon 


3 


of “k-space,” where V = bbb is the volume of the object itself. Suppose our 
sample contains N atoms, and each atom contributes q free electrons. (In prac- 
tice, N will be enormous—on the order of Avogadro's number, for an object 
of macroscopic size— whereas q is a small number— 1 or 2, typically.) If elec- 
trons were bosons (or distinguishable particles), they would all settle down to 
the ground state, i/1j;. ^ But electrons are in fact identical fermions, subject 
to the Pauli exclusion principle, so only two of them can occupy any given 
state. They will fill up one octant of a sphere in k-space,!^ whose radius, kr, 
is determined by the fact that each pair of electrons requires a volume m3/V. 


(Equation 5.40): 
1 eU P 
P ri 2d LEM 
8 X3 J 2 F, 


Thus | 
kp = Bon), [5.41] 
where N 
= Y [5.42] 


is the free electron density (the number of free electrons per unit volume). 

The boundary separating occupied and unoccupied states, in k-space, is called 
the Fermi surface (hence the subscript F). The corresponding energy is called the. 
Fermi energy, Er; for a free electron gas, | 


hê e l 
Er = z OPTY. [5.43] 


The total energy of the electron gas can be calculated as follows: A shell of 
thickness dk (Figure 5.4) contains a volume 


1 
gn k?) dk, 


Im assuming there is no appreciable thermal excitation, or other disturbance, to lift the solid 
ont of its collective ground state. If you. like, I'm talking about a “cold” solid, though (as you will show 
in Problem 5.16(c)), typical solids arc still "cold," in this Sense, far above room temperature. 

l^Because N is such a huge number, we need not worry about the distinction between the actual 
jagged edge of the grid and the smooth spherical surface. that approximates it. 
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ke 


. FIGURE 5.4: One octant of a spherical shell in k-space. 


so the number of electron states in the shell is 


2[((1/2)x k* dk] _ V 


zt dk. 
(x3/V) 
Each of these states carries an energy A?k7/2m (Equation 5.39), so the energy of 
the shell is a 
htk V 
dE = —— —;k! dk, [5.44] 
2m n? 


and hence the total energy is 


[5.45] 


n? V l| "E WV . Ran Ne ye 
0 102m — 10x2m 


This quantum mechanical energy plays a role rather analogous to the internal 
thermal energy (U) of an ordinary gas. In particular, it exerts a pressure on the 
walls, for if the box expands by an amount dV, the total energy decreases: 

2h? (3n?Nqy ^ 2. dV 
CAU yates gu 


dE e TG 
i 3  lO0z?m | gy? 


and this shows up as work done on the outside (dW = P dV) by the quantum 
pressure. P. Evidently 


2E« 2 Wk. Gn M^ ss 5/3 


= _— 5.46 
3 4-3 doa 5m l 
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Here, then, is a partial answer to the question of why a cold solid object doesn't 
simply collapse: There is a stabilizing internal pressure, having nothing to do with 
eléctron-electron repulsion (which we have ignored) or thermal motion (which we 
have excluded), but is strictly quantum mechanical, and derives ultimately from 
the antisymmetrization requirement for the wave functions of identical fermions. It 
is sometimes called degeneracy pressure, though “exclusion pressure" might be.a 
better term. P 


Problem 5.15 Find the. average energy per free electron (Etot/ Nq), as a fraction 
of the Fermi energy. Answer: (3/5) Er. 


Problem 5.16 The density of copper is 8.96 gm/cm?, and its atomic weight is 
63.5 gm/mole. 


(a) Calculate the Fermi energy for copper (Equation 5.43). Assume g = 1, and 
give your answer in electron volts. 


(b) What is the corresponding electron velocity? Hint: Set Er — (1 /2)mv. Is 
‘it safe to assume the electrons in copper are nonrelativistic? 


(c) At what temperature would the characteristic thermal energy (kp T, where kg 
is the Boltzmann constant and T is the Kelvin temperature) equal the Fermi 
energy, for copper? Comment: This is called the Fermi temperature. As 
long asthe. actual. temperature is substantially below the Fermi temperature, 
the material can be regarded. as “cold,” with most of the electrons in the 
lowest accessible state. Since the melting point. of copper is 1356 K, solid 
copper is always cold. 


(d) Calculate the degeneracy pressure (Equation 5.46) of copper, in the electron 
gas model. 


Problem 5.17 The bulk modulus of a substance is the ratio of a small decrease 
in pressure to the resulting fractional increase in volume: 


dP 

Bee. 

dV 
Show that B = (5/3)P, in the free electron gas model, and use your result. in 
Problem 5.16(d) to estimate the bulk modulus of copper. Comment: The observed 
value is 13.4 x 101? N/m?, but don't expect perfect agreement—after all, we're 


1^We derived Equations 5.41, 5:43, 5.45, and 5.46 for thé special case of an infinite rectangular 
well, but they hold for containers of any ‘shape, as: long as. the number of particles is extremely large. 


E 
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neglecting all electron-nucleus and electron-electron forces! Actually, it is rather 
surprising that this calculation comes as close as it does. 


5.3.2 Band Structure. 


We're now going to improve on the free electron model by including the forces 
exerted on the electrons by the regularly spaced, positively charged, essentially 
stationary nuclei. The qualitative behavior of solids is dictated to a remarkable. 
degree by the mere fact that this potential is periodic —its actual shape is relevant 
only to the finer details. To show you how it goes, I’m going to develop the simplest 
possible model: a one-dimensional Dirac comb, consisting of evenly spaced delta 
function spikes (Figure 5.5).16 But first I need to introduce à powerful theorem that 
vastly simplifies the analysis of periodic potentials. 
A periodic potential is one that repeats itself after some. fixed distance a: 


V(x + a) = V (x). [5.47] 

Bloch's theorem tells us that for such a. potential the solutions to the Schrödinger 
equation, | 
Rh? d? 
2m dx 


can be taken to satisfy the condition 
V +a) =e (x), [5.49] 


for some constant K (by "constant" I mean that it is independent of x; it may well 
depend on E). 


V(x) 


—2a -a 0 a 2a 3a 4a 5a 6a 


FIGURE 5.5: The Dirac’ comb, Equation 5.57. 


16Tt would be more natural to let the delta functions go down, so as to represent the attractive force 
of the nuclei. But then there would be negative energy solutions as well as positive energy solutions, 
and that makes the calculations more-cumbersome (see Problem 5.20). Since all. we're trying to do here 
is explore the consequences of periodicity, it is simpler to adopt this less plausible shape; if it comforts 
you, think. of the nuclei-as residing at ta/2, 1:3a/2, t 5a/2, .... 
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Proof: Let D be the “displacement” operator: 
Df (x) = f(x +a). [5.50] 
For a periodic potential (Equation 5.47), D commutes with the Hamiltonian: 
[D, H] — 0, [5.51] 


and hence we are free to choose eigenfunctions of H that arë simultaneously 
eigenfunctions of D: Dy = Ay, or 


p+ a) = Xy (x). [5.52] 


Now, A is certainly not zero (if it were, then—since Equation 5.52 holds for 
all x —we would immediately obtain Y (x) = 0, which is not a permissible 
eigenfunction); like any nonzero complex number, it can be expressed às an 
exponential: 


A= elke [5.53] 


for some constant K. QED 


At this stage Equation 5.53 is just a strange way to write the eigenvalue i, 
but in a moment we will discover that K is in fact real, so that. although y(x) 
itself is not periodic, |y(x)/ is: 


Weta)? = oor. [5.54] 
as one would certainly expect." 

Of course, no real solid goes on forever, and the edges are going to spoil 
the periodicity of V (x), and render Bloch's theorem inapplicable. However, for 
any macroscopic crystal, containing something on the order of Avogadro’s number 
of atoms, it is hardly imaginable that edge effects can significantly influence the 
behavior of electrons deep inside. This suggests the following dévice to salvage 
Bloch's theorem: We wrap the x-axis around in a circle, and connect it onto its 
tail, after a large number N ~ 107? of periods; formally, we impose the boundary 
condition | 

VG 4- Na) = ya). [5.55] 


It follows (from Equation 5.49) that 


eiNKay (x) = yix), 


'7Indeed, you might be tempted to reverse the argument, starting with Equation 5,54, as a way 
of proving Bloch’s theorem. It doesn’t. work, for Equation 5.54 alone would allow the phase factor in 
Equation 5.49 to be a function of x. 


a 
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so eiNKa — 1, or NKa = 2z n, and hence. 
K=—, (neo tL £32... [5.56] 


In particular, for this arrangement K is necessarily real. The virtue of Bloch's 
theorem is that we need only solve the Schródinger Equation within a single cell 
(say, on the interval 0 < x <a); recursive application of Equation 5.49 generates 
the solution everywhere else. 
Now, suppose the potential consists of a long string of delta-function spikes 
(the Dirac comb): 
N-1 
Væ) =a Y de — ja). [5.57] 


j=0 


(In Figure 5.5 you must imagine that the x-axis has been “wrapped. around,” so 
the Nth spike actually appears at x = —a.) No one would pretend that this is a 
realistic model, but remember, it is only the effect of periodicity that concerns us 
here; the classic. study! used a repeating rectangular pattern, and many authors 
still prefer that one.? In the region 0 < x <a. the potential is zero, so 


| duy " 
2m dx? Lin 
Or ; 
ay " 
— —k^w, 
dx? v, 
where 
mE 
ER IM [5.58] 
as usual. 
The general solution is 
V (x) = Asin(kx) + Bcos(kx), (0 «x <a), [5.59] 


According to Bloch's theorem, the wave function in the cell immediately to the 
left of the origin is 


w(x) = eKA sink(x +a) + Beosk(x+a)], (—a-«x-«0). [5.60] 


ISR. de L. Kronig and W. G. Penney, Proc. R. Soc. Lond., ser. A, 130, 499 (1930). 
I2See, for instance, D. Park, Introduction to the Quantum Theory, 3td ed., MeGraw-Hill, New 
York (1992). 
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At x = 0, y; must be continuous, so. 
B =e '*@[A sin(ka) + B cos(ka)]; [5.61] 
its dérivative suffers a discontinuity proportional to the strength of the delta function 


(Equation 2.125, with the sign of œ switched, since these. are spikes instead of 
wells): 


kA — eE KIA cos(ka) — B sin(ka)] = uos [5.62] 


Solving Equation 5.61 for A sin(ka) yields 
A sin(ka) = [e^ — cos(ka)]B. [5.63] 


Substituting this into Equation 5.62, and cancelling KB, wé find 


[F^ — cos(ka)] [1 — e ?** cos(ka)] + e 5&^ sin? (ka) = Ps sin(ka), 
which simplifies to 
: mo .. l l 
cos(Ka) = cos(ka) + 3; 650. [5.64] 


‘This is the fundamental result, from which all else follows. For the Kronig-Penney 
potential (see footnote 18), the formula is more complicated, but it shares the 
qualitative features we. are about to explore. 

Equation 5.64 determines the possible values of k, arid hence the allowed 
energies. To simplify the notation, let 


z=ka, aid p= [5.65] 


so the right side of Equation 5.64 can be written as 


f(z) =cos(z) + 8 ae [5.66] 


The constant B is a dimensionless measure of the "strength" of the delta function. 
In Figure 5.6 I have plotted f(z), for the case 6 — 10. The important thing to 
notice is that f(z) strays outsidé the range (—1, 4-1), and in such regions there is 
no hope of solving Equation 5.64, since | cos(Ka)|, of course, cannot be greater 
than 1. These gaps represent forbidden energies; they are separated by bands 
of allowed energies. Within a given band, virtually any energy is allowed, since 
according to Equation 5.56 Ka = 2x n/N; where N is a huge number, and 5 can 
be. any integer. You might imagine drawing N horizontal lines on Figure 5.6, at 
values of cos(2ztn/ N) ranging from +1 (n = 0) down to —1 (n = N/2), and back 
almost to +1 (n = N — 1)—at this point the Bloch factor e'*@ recycles, so no. 
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FIGURE 5.6: Graph of f(z) (Equation 5.66) for 8 = 10, showing allowed bands 
(shaded) separated by forbidden gaps (where |f (z)| > 1). 


gap 


band 
FIGURE 5.7: The allowed energies for a periodic 
potential form essentially continuous bands. 


new solutions are generated by further increasing n. The intersection of each of 
these lines with f(z) yields an allowed energy. Evidently there are N states in each 
band, so closely spaced that for most purposes we can regard them as forming a 
continuum (Figure 5.7). 

So far, we've only put one electron in our potential. In practice there will be 
Nq of them, where q is again the number of "free" electrons per atom. Because of 
the Pauli exclusion principle, only two electrons can occupy a given spatial state, 
so if q = 1, they will half fill the first band, if g = 2 they will completely fill the 
first band, if q = 3 they half fill the second band, and so on—in the ground state. 
(In three dimensions, and with more realistic potentials, the band structure may 
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be more complicated, but the existence of allowed bands, separated by forbidden 
gaps, persists—band structure is the signature of a periodic potential.) 

Now, if a band is entirely filled, it takes a relatively large energy to excite 
an electron, since it has to jump across the forbidden zone. Such materials will 
be electrical insulators. On the other hand, if a band is only partly filled, it takes 
very little energy to excite an electron, and such materials are typically conduc- 
tors. If you dope an insulator with a few atoms of larger or smaller q, this puts 
some “extra” electrons into the next higher band, or creates some holes in the 
previously filled one, allowing in either case for weak electric currents to flow; 
such materials are called semiconductors. In the free electron model all solids 
should be excellent conductors, since there are no large gaps in the spectrum of 
allowed energies. It takes the band theory to account for the extraordinary range 
of electrical conductivities exhibited by the solids in nature. 


Problem 5.18 


(a) Using Equations 5.59 and 5.63, show that the wave function for a particle in 
the periodic delta function potential can be written in the form 


w(x) = C[sinkx) e! K"^sink(a x). (0 <x xa). 
(Don't bother to determine the normalization constant C.) 


(b) There is an exception: At the top of a band, where z is an integer multiple 
of zx (Figure 5.6), (a) yields y(x) = 0. Find the correct wave function for 
this case. Note what happens to y at each delta function. 


Problem 5.19 Find the energy at the bottom of the first allowed band, for the 
case B — 10, correct to three significant digits. For the sake of argument, assume 
aja = 1 eV. 


x «Problem 5.20 Suppose we use delta function wells, instead of spikes (i.e., switch the 
sign of a in Equation 5.57). Analyze this case, constructing the analog to Figure 5.6. 
This requires no new calculation, for the positive energy solutions (except that B is 
now negative; use 8 = —1.5 for the graph), but you do need to work out the negative 
energy solutions (let k =  —2mE/h and z = —ka, for E < 0). How many states 
are there in the first allowed band? 


Problem 5.21 Show that most of the energies determined by Equation 5.64 are 
doubly degenerate. What are the exceptional cases? Hint: Try it for N = 1, 2, 3. 
4, ..., to see how it goes. What are the possible values of cos(Ka) in each case? 
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5.4 QUANTUM STATISTICAL MECHANICS 


At absolute zero, a physical system occupies its lowest allowed energy configura- 
tion. As we turn up the temperature, random thermal activity will begin to populate 
the excited states, and this raises the following question: If we have a large number 
N of particles, in thermal equilibrium at temperature T, what is the probability that 
a particle, selected at random, would be found to have the specific energy, E;? 
Note that the "probability" in question has nothing to do with quantum indeter- 
minacy —exactly the same question arises in classical statistical mechanics. The 
reason we must be content with a probabilistic answer is that we are typically 
dealing with enormous numbers of particles, and we could not possibly expect 
to keep track of each one separately, whether or nor the underlying mechanics is 
deterministic. 

The fundamental assumption of statistical mechanics is that in thermal 
equilibrium every distinct state with the same total energy, E, is equally proba- 
ble. Random thermal motions constantly shift energy from one particle to another, 
and from one form (rotational, kinetic, vibrational, etc.) to another, but (absent 
external influences) the tota! is fixed by conservation of energy. The assumption 
(and it's a deep one, worth thinking about) is that this continual redistribution of 
energy does not favor any particular state. The temperature, 7, is simply a mea- 
sure of the total energy of a system in thermal equilibrium. The only new twist 
introduced by quantum mechanics has to do with how we count the distinct states 
(it's actually easier than in the classical theory, because the states are generally 
discrete), and this depends critically on whether the particles involved are dis- 
tinguishable, identical bosons, or identical fermions. The arguments are relatively 
straightforward, but the arithmetic gets pretty dense, so I’m going to begin with 
an absurdly simple example, so you'll have a clear sense of what is at issue when 
we come to the general case. 


5.4.1 An Example 


Suppose we have just three noninteracting particles (all of mass m) in the one- 
dimensional infinite square well (Section 2.2). The total energy is 


2+2 


mh 
E=E, + Ep + Ec = (ng +3 +10) [5.67] 


(see Equation 2.27), where n4, ng, and nc are positive integers. Now suppose, for 
the sake of argument, that E = 363(22h7/2ma?), which is to say, 


nÀ +n} ne = 363. [5.68] 


There are, as it happens, 13 combinations of three positive integers, the sum 
of whose squares is 363: All three could be 11, two could be 13 and one 5 
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(which occurs in three permutations), one could be 19 and two | (again, three 
permutations), or one could be 17, one 7, and one 5 (six permutations). Thus 
(nA. np, Nc) is one of the following: 


(11, 11, 11). 
(13, 13,5). (13,5, 13), (5, 13, 13). 
(1, 1. 19), (1.19.1), (19, 1. 1). 
(5,7. 17), (5, 17,7). (7,5. 17). (7.17.5). (17, 5,7), (de 5). 


If the particles are distinguishable, each of these represents a distinct quantum 
state, and the fundamental assumption of statistical mechanics says that in thermal 
equilibrium? they are all equally likely. But I’m not interested in knowing which 
particle is in which (one-particle) state, only the total number of particles in each 
state—the occupation number, N,, for the state y,,. The collection of all occu- 
pation numbers for a given 3-particle state we will call the configuration. If all 
three are in yj, the configuration is 


(0. 0, 0, 0. 0, 0. 0. 0. 0.0, 3. 0, 0,0. 0.0, 0. 0... .). [5.69] 

(i.e., Nj; = 3, all others zero). If two are in ya and one is in ys, the configura- 
tion is 

(0.0.0.0, 1.0,0.0.0,0,0. 0. 2, 0,0. 0.0. ...). [5.70] 


(i.e, N5 = 1, Ni3 = 2, all others zero). If two are in y; and one is in yi9, the 
configuration is 


(2, 0, 0, 0. 0, 0, 0. 0, 0.0. 0, 0. 0, 0,0, 0. 0.0. 1. 0. ...). [5.71] 


i.e., Nj = 2, Nig = 1, all others zero). And if there is one particle in ws, one in 
V7, and one in v,7, the configuration is 


(0. 0. 0, 0. 1. 0. 1.0.0.0, 0,0. 0.0. 0,0. 1. 0. 0. ...). [5.72] 


(i.e., Ns = N7 = Ny; = 1, all others zero.) Of these, the last is the most probable 
configuration, because it can be achieved in six different ways, whereas the middle 
two occur three ways, and the first only one. 


WHow the particles maintain thermal equilibrium. if they really don't interact at all. is a problem 
I'd rather not worry about — maybe God reaches in periodically and stirs things up (being careful not 
to add or remove any energy). In real life, of course, the continual redistribution of energy is caused 
precisely by interactions between the particles. so if you don't approve of divine intervention let there 
be extremely weak interactions— sufficient to thermalize the system (at least. over long time periods). 
but too small to alter the stationary states and the allowed energies appreciably. 
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Returning now to my original question, if we select one of these three particles 
at random, what is the probability (P„) of getting a specific (allowed) energy En? 
The only way you can get E; is if it’s in the third configuration (Equation 5.71); the 
chances of the system being in that configuration are 3 in 13, and in that config- 
uration the probability of getting E; is 2/3, so P, = (3/13) x (2/3) = 2/13. 
You could get Es either from configuration 2 (Equation 5.70)—chances 3 in 
13—with probability 1/3, or from configuration 4 (Equation 5.72)— chances 6 
in 13—with probability 1/3, so Ps = (3/13) x (1/3) + (6/13) x (1/3) = 3/13. 
You can only get E; from configuration 4: P; = (6/13) x (1/3) = 2/13. Like- 
wise, Ej, comes only from the first configuration (Equation 5.69)— chances 1 in 
13— with probability 1: Pj; = (1/13). Similarly, Pi} = (3/13) x (2/3) = 2/13, 
Pj; = (6/13) x (1/3) = 2/13, and Pig = (3/13) x (1/3) = 1/13. As a check, 
note that 


Nos RO PIER ARERSS QE qq p Qc ed 
ITesT TPT Berry = ata BIB tpt Bt Bt 


That’s when the particles are distinguishable. If in fact they are identical 
fermions, the antisymmetrization requirement (leaving aside spin, for simplicity — or 
assuming they are all in the same spin state, if you prefer) excludes the first three 
configurations (which assign two—or, worse still, three— particles to the same 
state), and there is just one state in the fourth configuration (see Problem 5.22(a)). 
For identical fermions, then, Ps = P; = Pj; = 1/3 (and again the sum of the 
probabilities is 1). On the other hand, if they are identical bosons the symmetriza- 
tion requirement allows for one state in each configuration (see Problem 5.22(b)), 
so Pi = (1/4) x (2/3) = 1/6, Ps = (1/4) x (1/3) + (1/4) x (1/3) = 1/6, 
P; = (1/4) x (1/3) = 1/12, Pi = (1/4) x (1) = 1/4, Pi3 = (0/4) x (2/3) = 1/6, 
Py? = (1/4) x (1/3) = 1/12, and Pio = (1/4) x (1/3) = 1/12. As always, the 
sum is 1. 

The purpose of this example was to show you how the counting of states 
depends on the nature of the particles. In one respect it was actually more compli- 
cated than the realistic situation, in which N is a huge number. For as N grows, 
the most probable configuration (in this example, Ns = N7 = Nj; = 1, for the case 
of distinguishable particles) becomes overwhelmingly more likely than its competi- 
tors, so that, for statistical purposes, we can afford to ignore the others altogether:?! 
The distribution of individual particle energies, at equilibrium, is simply their dis- 
tribution in the most probable configuration. (If this were true for N = 3— which, 
obviously, it is not —we would conclude that P5 = P; = Pj; = 1/3 for the case 
of distinguishable particles.) l'll return to this point in Section 5.4.3, but first we 
need to generalize the counting procedure itself. 


2! This is an astonishing and counterintuitive fact about the statistics of large numbers. For a good 
discussion see Ralph Baierlein, Thermal Physics. Cambridge U.P. (1999). Section 2.1. 
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*Problem 5.22 


(a) Construct the completely antisymmetric wave function y (x4. xp, xc) for 
three identical fermions, one in the state y5, one in the state w7, and one in 
the state V7. 


(b) Construct the completely symmetric wave function y (x4. xg. xc) for three 
identical bosons, (i) if all three are in state Vj, (ii) if two are in state Vj 
and one is in state 19, and (iii) if one is in the state ws, one in the state v7, 
and one in the state W17. 


xProblem 5.23 Suppose you had three (noninteracting) particles, in thermal equi- 
librium, in a one-dimensional harmonic oscillator potential, with a total energy 
E = (9/2)hw. 


(a) If they are distinguishable particles (but all with the same mass), what are 
the possible occupation-number configurations, and how many distinct (three- 
particle) states are there for each one? What is the most probable configura- 
tion? If you picked a particle at random and measured its energy, what values 
might you get, and what is the probability of each one? What is the most 
probable energy? 


(b) Do the same for the case of identical fermions (ignoring spin, as we did in 
Section 5.4.1). 


(c) Do the same for the case of identical bosons (ignoring spin). 


5.4.2 The General Case 


Now consider an arbitrary potential, for which the one-particle energies are E1, E», 
E3,..., with degeneracies dj. d2, d3.... (ie. there are d, distinct one-particle 
states with energy Ej). Suppose we put N particles (all with the same mass) into 
this potential; we are interested in the configuration (Nj. No. N3. ...), for which 
there are N, particles with energy Ej, N2 particles with energy E», and so on. 
Question: How many different ways can this be achieved (or, more precisely, 
how many distinct states correspond to this particular configuration)? The answer, 
Q(N,, N35. Ns. ...), depends on whether the particles are distinguishable, identical 
fermions, or identical bosons, so we'll treat the three cases separately.?? 

First, assume the particles are distinguishable. How many ways are there to 
select (from the N available candidates) the N to be placed in the first “bin”? 


~The presentation here follows closely that of Amnon Yariv, An Introduction to Theory and 
Applications of Quantum Mechanics. Wiley. New York (1982). 
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Answer: the binomial coefficient, “N choose Nj," 


NY. N! 
(5) C NKN- NDV [5.73] 


For there are N ways to pick the first particle, leaving (N — 1) for the second, and 


so on: 


N! 
ONE AE am a Ls ENSE 


However, this counts separately the N;! different permutations of the N, particles, 
whereas we don’t care whether number 37 was picked on the first draw, or on 
the 29th draw; so we divide by N;!, confirming Equation 5.73. Now, how many 
different ways can those N, particles be arranged within the first bin? Well, there 
are d, states in the bin, so each particle has dı choices; evidently there are (dj)! 
possibilities in all. Thus the number of ways to put Nj particles, selected from a 
total population of N, into a bin containing d, distinct options, is 


N'd 
Ni (N — ND 


The same goes for bin 2, of course, except that there are now only (N — Nj) 
particles left to work with: 


(N -Npa 
No'!(N — N; — No)! 


and so on. It follows that 


O(N. N2. Na. ...) 


| Nd (N — N)!d$? (N — Ny — No)!d3° 

— NINN- Ni)! NNN — Ny — No)! NN — Ni — N2 — N3)! 
d" ghz c 99 a 

= N! L2 a NI LN 5.74 
N,!N2!N3!... I] AN l l 


(You should pause right now and check this result, for the example in Section 5.4.1 — 
see Problem 5.24.) 

The problem is a lot easier for identical fermions. Because they are indis- 
tinguishable, it doesn’t matter which particles are in which states—the antisym- 
metrization requirement means that there is just one N-particle state in which a 
specific set of one-particle states are occupied. Moreover, only one particle can 
occupy any given state. There are 

(i) 
Nn 


Section 5.4: Quantum Statistical Mechanics 235 


ways to choose the N, occupied states in the nth bin,?* so 


oo 


Q(Ni. No». N3....) = aaa, rt 


n=] 


[5.75] 


(Check it for the example in Section 5.4.1—see Problem 5.24.) 

The calculation is hardest for the case of identical bosons. Again, the sym- 
metrization requirement means that there is just one N-particle state in which a 
specific set of one-particle states are occupied, but this time there is no restriction 
on the number of particles that can share the same one-particle state. For the nth 
bin, the question becomes: How many different ways can we assign N, identical 
particles to d, different slots? There are many tricks to solve this combinatorial 
problem; an especially clever method is as follows: Let dots represent particles and 
crosses represent partitions, so that, for example, if dp = 5 and N, = 7, 


ee x exeo eve x e x 


would indicate that there are two particles in the first state, one in the second, three 
in the third, one in the fourth, and none in the fifth. Note that there are N,, dots, 
and (d, — 1) crosses (partitioning the dots into d, groups). If the individual dots 
and crosses were labeled, there would be (N, + d, — 1)! different ways to arrange 
them. But for our purposes the dots are all equivalent — permuting them (N,,! ways) 
does not change the state. Likewise, the crosses are all equivalent —permuting them 
((d, — 1)! ways) changes nothing. So there are in fact 


(Nn o d, — 1)! = n +d, — ) 
Ny (dj xs 1)! Nn 


distinct ways of assigning the N, particles to the dj, one-particle states in the nth 
bin, and we conclude that 


[5.76] 


oo 
(Nn + d, = 1)! 
Ni, N2, Na...) = c 5.77 
Q(N1. N2, Ns. ...) I N, (d, — Di [5.77] 


(Check it for the Example in Section 5.4.1—see Problem 5.24.) 


*Problem 5.24 Check Equations 5.74, 5.75, and 5.77, for the example in Sec- 
tion 5.4.1. 


x xProblem 5.25 Obtain Equation 5.76 by induction. The combinatorial question is 
this: How many different ways can you put N identical balls into d baskets (never 


23This should be zero. of course, if Ny > dy. and it is, provided we consider the factorial of a 
negative integer to be infinite. 
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mind the subscript n for this problem). You could stick all N of them into the third 
basket, or all but one in the second basket and one in the fifth, or two in the first 
and three in the third and all the rest in the seventh, etc. Work it out explicitly for 
the cases N = 1, N = 2, N = 3, and N = 4; by that stage you should be able to 
deduce the general formula. 


5.4.3 The Most Probable Configuration 


In thermal equilibrium, every state with a given total energy E and a given particle 
number N is equally likely. So the most probable configuration (Nj, N2. Na, ...) 
is the one that can be achieved in the largest number of different ways—-it is that 
particular configuration for which Q(Nj. N2, Na. ...) is a maximum, subject to 
the constraints 


oo 
> M =N, [5.78] 
n=! 
and 
oo 


N, En = E. [5.79] 


The problem of maximizing a function F (xj. x2, xa. ...) of several variables, 
subject to the constraints fj (x), X2. X3. ...) = 0, fa(xi, X2, X3. ...) = 0, etc., is 
most conveniently handled by the method of Lagrange multipliers.2^ We introduce 
the new function 


GOL X». X3... Àj, À2.... ) BF+A fi afa, [5.80] 


and set all its derivatives equal to zero: 


dG dG 
=0; —=0 
Xn dAn 


[5.81] 


In our case it's a little easier to work with the logarithm of Q, instead of 
Q itself—this turns the products into sums. Since the logarithm is a monotonic 
function of its argument, the maxima of Q and In(Q) occur at the same point. So 


we let 
G00) es w-Ym |o EY. [5.82] 


n=} n=l 


4 See, for example, Mary Boas. Mathematical Methods in the Physical Sciences, 2nd ed.. Wiley. 
New York (1983), Chapter 4, Section 9. 
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where œ and P are the Lagrange multipliers. Setting the derivatives with respect 
to œ and f equal to zero merely reproduces the constraints (Equations 5.78 and 
5.79); it remains, then, to set the derivative with respect to N, equal to zero. 

If the particles are distinguishable, then Q is given by Equation 5.74, and we 
have 


G =In(N!) + Y ^ [N, In(d;) — In(N,!)] 


n=l 


[5.83] 
oo co 
+a N- Nn + B E- 3 NE, 
n=! n=l 


Assuming the relevant occupation numbers (N,,) are large, we can invoke Stirling’s 
approximation:? 
ln(z!) ?zln(z —z forz> 1, [5.84] 


to write 


oo 
Gg 5 [Nn In(dy) — Ny In(N4) + Nn — o N, — PEn Nn] 


n=] 


+In(N!)+aN + BE. [5.85] 

It follows that 3G 
—_—_ = In(d,) = In( N,) =.= B Es. [5.86] 

9 N, 


Setting this equal to zero, and solving for N,, we conclude that the most probable 
occupation numbers, for distinguishable particles, are 


N, = dpe CtP En) , [5.87] 


If the particles are identical fermions, then Q is given by Equation 5.75, and 
we have 
oo 


G —  (In(d;) — n(n!) — In[(d, — N,)!]] 


n=l 


[5.88] 
oo oo 
+a N — 3 Nn + B E — $ NE, 


n=l n-l 


P Stirling's approximation can be improved by including more terms in the Stirling series, but 
the first two will suffice for our purposes. See George Arfken and Hans-Jurgen Weber. Mathematical 
Methods for Physicists, 5th ed.. Academic Press, Orlando (2000). Section 10.3. If the relevant occupation 
numbers are not large—as in Section 5.4.1 —then statistical mechanics simply doesn't apply. The whole 
point is to deal with such enormous numbers that statistical inference is a reliable predictor. Of course. 
there will always be one-particle states of extremely high energy that are not populated at all: fortunately. 
Stirling’s approximation holds also for z = 0. I use the word "relevant" to exclude any stray states right 
at the margin, for which N,, is neither huge nor zero. 
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This time we must assume not only that N, is large, but also that d, >> Nn 78 so 
that Stirling's approximation applies to both terms. In that case 


oo 
GX P [ina D — Ny In(Nn) + Nn — (dn — Nn) In(dn — Nn) 
n=] [5.89] 


+ (di — Nn) - aN, — BEN, | +aN + BE, 


SO 
dG 


L— = —In(Nn) + In(dy — Ny) — à — BE,. [5.90] 
0 N, 


Setting this equal to zero, and solving for N,,, we find the most probable occupation 
numbers for identical fermions: 


dy 


Nn = e(&+BEn) + ] i 


[5.91] 


Finally, if the particles are identical bosons, then Q is given by Equation 5.77, 
and we have 


G — {In[(Nn + d, — D!] — (Nr!) — In[(ds — 1)9) 


n=l 


[5.92] 
oo oo 
+a L = »] 4B E = Y 
n=l n=l 
Assuming (as always) that N,, >> 1, and using Stirling's approximation: 
oo 
G ~ 3 (Ns T dn T 1) In(N, t d, = 1) = (Nn + dn = 1) s Nn In(N,) 
n=] [5.93] 
+ Nn —In[(dn — 1)!] - o Nn — BEn Ny} +aN + BE. 
SO 
dG 
T =In(N, +d, — 1) — In(N,,) — o — BE,. [5.94] 


ONn 


*61n one dimension the energies are nondegenerate (see Problem 2.45), but in three dimensions 


dy typically increases rapidly with increasing n (for example, in the case of hydrogen, dn = n°). So 
it is not unreasonable to assume that for most of the occupied states dj, >> 1. On the other hand. 
dy is certainly not much greater than N, at absolute zero, where all states up to the Fermi level are 
filled. and hence d, = Nj. Here again we are rescued by the fact that Stirling's formula holds also for 
z —Q. 
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Setting this equal to zero, and solving for N,, we find the most probable occupation 
numbers for identical bosons: 


d, —1 


Nn = e@+BEn) —] = * 


[5.95] 
(For consistency with the approximation already invoked in the case of fermions, 
we should really drop the 1 in the numerator, and I shall do so from now on.) 


Problem 5.26 Use the method of Lagrange multipliers to find the rectangle of 
largest area, with sides parallel to the axes, that can be inscribed in the ellipse 
(x /a)? + (y/b)? = 1. What is that maximum area? 


Problem 5.27 


(a) Find the percent error in Stirling's approximation for z — 10. 


(b) What is the smallest integer z such that the error is less than 196? 


5.4.4 Physical Significance of « and f 


The parameters a and 6 came into the story as Lagrange multipliers, associated 
with the total number of particles and the total energy, respectively. Mathemati- 
cally, they are determined by substituting the occupation numbers (Equations 5.87, 
5.91, and 5.95) back into the constraints (Equations 5.78 and 5.79). To carry out 
the summation, however, we need to know the allowed energies (£,), and their 
degeneracies (dj), for the potential in question. As an example, I'll work out the 
case of an ideal gas—a large number of noninteracting particles, all with the same 
mass, in the three dimensional infinite square well. This will motivate the physical 
interpretation of o and f. 
In Section 5.3.1 we found the allowed energies (Equation 5.39): 


R 
h^» 


2m ` 


Ty TNA y An- 
k= E x . 
( bo ly kh ) 


As before, we convert the sum into an integral, treating k as a continuous variable, 
with one state (or, for spin s, 2s + 1 states) per volume z?/V of k-space. Taking 


Ex [5.96] 


where 
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as our “bins” the spherical shells in the first octant (see Figure 5.4), the “degener- 
acy” (that is, the number of states in the bin) is 


| l4xkdk V 
| 8(G)/V) 2x? 
For distinguishable particles (Equation 5.87), the first constraint (Equation 5.78) 


k? dk. [5.97] 


k 


becomes 
3/2 
N= sind rl g PIC /2m 42 dk = Ve~% ( m ;) / 
2n^ — Jo 2n Bh? 
SO " 
N [2nBh? V 
e =p (ER [5.98] 


The second constraint (Equation 5.79) says 


2 


2 3/2 
= V pat T BREK? am pA dk = 3V -a ( m ) / . 
2n2 2m 0 2p 2n Bh? 


or, putting in Equation 5.98 for e ^: 

_3N 
26` 

(If you include the spin factor, 2s + 1, in Equation 5.97, it cancels out at this point, 

so Equation 5.99 is correct for all spins.) 


This result (Equation 5.99) is reminiscent of the classical formula for the 
average kinetic energy of an atom at temperature T et 


E [5.99] 


3 
—= 5 kal. [5.100] 


where kp is the Boltzmann constant. This suggests that 6 is related to the temper- 


ature: | 


To prove that this holds in general, and not simply for distinguishable particles 
in the three-dimensional infinite square well, we would have to demonstrate that 
different substances in thermal equilibrium with one another have the same value of 
f. The argument is sketched in many books,*® but I shall not reproduce it here—1 
will simply adopt Equation 5.101 as the definition of T. 


B [5.101] 


=7See. for example. David Halliday, Robert Resnick. and Jearl Walker. Fundamentals of Physics. 
5th ed.. Wiley. New York (1997). Section 20-5. 


?8See. for example. Yariv (footnote 22). Section 15.4. 
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It is customary to replace œ (which, as is clear from the special case of 
Equation 5.98, is a function of T) by the so-called chemical potential, 


u(T) = —akgT, [5.102] 


and rewrite Equations 5.87, 5.91, and 5.95 as formulas for the most probable 
number of particles in a particular (one-particle) state with energy € (to go from 
the number of particles with a given energy to the number of particles in a 
particular state with that energy, we simply divide by the degeneracy of the 
state): 


e 6-MkaT MAXWELL-BOLTZMANN 


l 
n(e) = Eae T1 FERMI-DIRAC [5.103] 


l 


EES BOSE-EINSTEIN 


The Maxwell-Boltzmann distribution is the classical result, for distinguishable 
particles; the Fermi-Dirac distribution applies to identical fermions, and the Bose- 
Einstein distribution is for identical bosons. 

The Fermi-Dirac distribution has a particularly simple behavior as T — 0: 


(e—u)/kgT 0. if e < i (0). 
E >] co, ife > (0), 


SO 
l. ife < u(0). 


0; -ikes a0). eee 


n(e) > | 


All states are filled, up to an energy u (0), and none are occupied for energies above 
this (Figure 5.8). Evidently the chemical potential at absolute zero is precisely the 
Fermi energy: 

u(0) = Ep. [5.105] 


As the temperature rises, the Fermi-Dirac distribution “softens” the cutoff, as indi- 
cated by the rounded curve in Figure 5.8. 

Returning now to the special case of an ideal gas, for distinguishable particles 
we found that the total energy at temperature T is (Equation 5.99) 


E= Š NAT. [5.106] 


242 Chapter § Identical Particles 


ne) 


Ec = u(0) e 


FIGURE 5.8: Fermi-Dirac distribution for T = 0 and for T somewhat above zero. 


while (from Equation 5.98) the chemical potential is 


N 3 Qn h? 
T) =kpT | In{ — =| . .107 
AO) B (2) | [5.107] 


I would /ike to work out the corresponding formulas for an ideal gas of identical 
fermions and bosons, using Equations 5.91 and 5.95 in place of Equation 5.87. The 
first constraint (Equation 5.78) becomes 


E ua k? 
md — Á— 7 5.108 
23? Í el(}7k-/2m)—-u}/kBT + 1 i | | 


(with the plus sign for fermions and minus for bosons), and the second constraint 
(Equation 5.79) reads 


LN s kA 
= z:x.d ne ieee [5.109] 
274 2m o el /2m-uy/ksT + 1 


The first of these determines u(T), and the second determines E(T) (from the 
latter we obtain, for instance, the heat capacity: C = 0E /dT). Unfortunately, these 
integrals cannot be evaluated in terms of elementary functions, and I shall leave it 
for you to explore the matter further (see Problems 5.28 and 5.29). 


Problem 5.28 Evaluate the integrals (Equations 5.108 and 5.109) for the case of 
identical fermions at absolute zero. Compare your results with Equations 5.43 and 
5.45. (Note that for electrons there is an extra factor of 2 in Equations 5.108 and 
5.109, to account for the spin degeneracy.) 
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* * xProblem 5.29 


(a) 


(b) 


(c) 


Show that for bosons the chemical potential must always be less than the 
minimum allowed energy. Hint: n(€) cannot be negative. 


In particular, for the ideal bose gas, w(T) < 0 for all T. Show that in this 
case uL (T) monotonically increases as T decreases, assuming N and V are 
held constant. Hint: Study Equation 5.108, with the minus sign. 


A crisis (called Bose condensation) occurs when (as we lower T) (T) hits 
zero. Evaluate the integral, for u = 0, and obtain the formula for the critical 
temperature 7, at which this happens. Below the critical temperature, the 
particles crowd into the ground state, and the calculational device of replacing 
the discrete sum (Equation 5.78) by a continuous integral (Equation 5.108) 
loses its validity.?? Hint: 


eo ys-l 

i dx =T(s)f(s), [5.110] 
0 e*—1 

where I" is Euler's gamma function and ¢ is the Riemann zeta function. 

Look up the appropriate numerical values. 


Find the critical temperature for ^He. Its density, at this temperature, is 
0.15 gm/cm?. Comment: The experimental value of the critical temperature 
in “He is 2.17 K. The remarkable properties of *He in the neighborhood of 
T, are discussed in the reference cited in footnote 29. 


5.4.5 The Blackbody Spectrum 


Photons (quanta of the electromagnetic field) are identical bosons with spin 1, but 


they 


are very special, because they are massless particles, and hence intrinsically 


relativistic. We can include them here, if you are prepared to accept four assertions 
that do not belong to nonrelativistic quantum mechanics: 


1. 


2. 


3. 


The energy of a photon is related to its frequency by the Planck formula, 
E — hv = ho. 


The wave number k is related to the frequency by k = 2z/A = w/c, where 
c is the speed of light. 


Only two spin states occur (the quantum number 7 can be +1 or —1, but 
not 0). 


29 See F. Mandl. Statistical Physics. Wiley. London (1971). Section 11.5. 
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4. The number of photons is not a conserved quantity; when the temperature 
rises, the number of photons (per unit volume) increases. 


In view of item 4, the first constraint (Equation 5.78) does not apply. We 
can take account of this by simply setting œ — 0, in Equation 5.82 and every- 
thing that follows. Thus the most probable occupation number, for photons, is 


(Equation 5.95): 
dk 


Nar eholkaT 5] 


[5.111] 


For free photons in a box of volume V, d; is given by Equation 5.97, multiplied 


by 2 for spin (item 3), and expressed in terms of w instead of k (item 2): 


V 
= — o do. [5.112] 
TC 


pias [5.113] 


z2c? (ehe kaT - 1) : 


This is Planck's famous formula for the blackbody spectrum, giving the energy 
per unit volume, per unit frequency, for an electromagnetic field in equilibrium at 
temperature T. It is plotted, for three different temperatures, in Figure 5.9. 


Problem 5.30 


(a) Use Equation 5.113 to determine the energy density in the wavelength range 
dA. Hint: Set p(w) dw = p(X) dA, and solve for p(A.). 


(b 


— 


Derive the Wien displacement law for the wavelength at which the black- 
body energy density is a maximum: 


2.90 x 107? mK 
ÀÁmax = ——————— —. [5.114] 
T 
Hint: You'll need to solve the transcendental equation (5 — x) = 5e", 
using a calculator or a computer; get the numerical answer accurate to three 
significant digits. 


J0In truth, we have no business using this formula. which came from the (nonrclativistic) 
Schródinger equation: fortunately, the degencracy is exactly the same for the relativistic case. See 
Problem 5.36. 
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~<— Visible region ———*»- 


p(«) [10715 J/m? Hz] 


2000 K 


Frequency [1014 Hz] 


FIGURE 5.9:  Planck's formula for the blackbody spectrum, Equation 5.113. 


Problem 5.31 Derive the Stefan-Boltzmann formula for the total energy density 
in blackbody radiation: 


E aky 4 xiu eari NA 
== (Sk) r = (7.57 x 10716 Jm?K jus [5.115] 


Hint: Use Equation 5.110 to evaluate the integral. Note that ¢ (4) = 2*/90. 


FURTHER PROBLEMS FOR CHAPTER 5 


Problem 5.32 Imagine two noninteracting particles, each of mass m, in the one- 
dimensional harmonic oscillator potential (Equation 2.43). If one is in the ground 
state, and the other is in the first excited state, calculate ((x, — x2)?), assuming 
(a) they are distinguishable particles, (b) they are identical bosons, and (c) they are 
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identical fermions. Ignore spin (if this bothers you, just assume they are both in 
the same spin state). 


Problem 5.33 Suppose you have three particles, and three distinct one-particle 
states (Va (x), Wp(x), and v.(x)) are available. How many different three-particle 
states can be constructed, (a) if they are distinguishable particles, (b) if they are 
identical bosons, (c) if they are identical fermions? (The particles need not be in 
different states—Wa(X1) Va (x2) Va (x3) would be one possibility, if the particles are 
distinguishable.) 


Problem 5.34 Calculate the Fermi energy for noninteracting electrons in a two- 
dimensional infinite square well. Let ø be the number of free electrons per unit 
area. 


x x xProblem 5.35 Certain cold stars (called white dwarfs) are stabilized against grav- 
itational collapse by the degeneracy pressure of their electrons (Equation 5.46). 
Assuming constant density, the radius R of such an object can be calculated as 
follows: 


(a) Write the total electron energy (Equation 5.45) in terms of the radius, the 
number of nucleons (protons and neutrons) N, the number of electrons per 
nucleon q, and the mass of the electron mm. 


(b) Look up. or calculate, the gravitational energy of a uniformly dense sphere. 
Express your answer in terms of G (the constant of universal gravitation), 
R, N, and M (the mass of a nucleon). Note that the gravitational energy is 
negative. 


(c) Find the radius for which the total energy, (a) plus (b), is a minimum. Answer: 


R- 230 hg! 
4 GmM?N!/3" 


(Note that the radius decreases as the total mass increases!) Put in the actual 
numbers, for everything except N, using q = 1/2 (actually, g decreases a bit 
as the atomic number increases, but this is close enough for our purposes). 
Answer: R = 7.6 x 10°N7!/3 m. 


= 


Determine the radius, in kilometers, of a white dwarf with the mass of the. 
sun. 


(e) Determine the Fermi energy, in electron volts, for the white dwarf in (d), 
and compare it with the rest energy of an electron. Note that this system is 
getting dangerously relativistic (see Problem 5.36). 
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x x «Problem 5.36 We can extend the theory of a free electron gas (Section 5.3.1) to 
the relativistic domain by replacing the classical kinetic energy, E — p?/2m, with 
the relativistic formula, E = y p?c? + m?c^ — mc?. Momentum is related to the 
wave vector in the usual way: p = Ak. In particular, in the extreme relativistic 
limit, E ~ pc = lick. 


(a) 


(b) 


Replace /i?k?/2m in Equation 5.44 by the ultra-relativistic expression, lick, 
and calculate Ej, in this regime. 


Repeat parts (a) and (b) of Problem 5.35 for the ultra-relativistic electron gas. 
Notice that in this case there is no stable minimum, regardless of R; if the 
total energy is positive, degeneracy forces exceed gravitational forces, and the 
star will expand, whereas if the total is negative, gravitational forces win out, 
and the star will collapse. Find the critical number of nucleons, Ne, such that 
gravitational collapse occurs for N > Ne. This is called the Chandrasekhar 
limit. Answer: 2.04 x 10?7. What is the corresponding stellar mass (give 
your answer as a multiple of the sun's mass). Stars heavier than this will not 
form white dwarfs, but collapse further, becoming (if conditions are right) 
neutron stars. 


At extremely high density, inverse beta decay, e^ + pt — n + v, converts 
virtually all of the protons and electrons into neutrons (liberating neutrinos, 
which carry off energy, in the process). Eventually neutron degeneracy pres- 
sure stabilizes the collapse, just as electron degeneracy does for the white 
dwarf (see Problem 5.35). Calculate the radius of a neutron star with the 
mass of the sun. Also calculate the (neutron) Fermi energy, and compare 
it to the rest energy of a neutron. Is it reasonable to treat a neutron star 
nonrelativistically? 


x x «Problem 5.37 


(a) 


Find the chemical potential and the total energy for distinguishable particles in 
the three dimensional harmonic oscillator potential (Problem 4.38). Hint: The 
sums in Equations 5.78 and 5.79 can be evaluated exactly, in this case— no 
need to use an integral approximation, as we did for the infinite square well. 
Note that by differentiating the geometric series, 


l oo 
ree ae = wey [5.1 16] 


n=0 


you can get 


d x = " 
acr = ont Dx 


n=l 
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and similar results for higher derivatives. Answer: 


LS ur) i 


E = ;ve( 


5.117 
5 [5.117] 


(b) Discuss the limiting case kgT < hw. 


(c) Discuss the classical limit, kgT >> hc, in the light of the equipartition 
theorem?! How many degrees of freedom does a particle in the three 
dimensional harmonic oscillator possess? 


3! See, for example, Halliday and Resnick (footnote 27). Section 20-9. 


PART II APPLICATIONS 


CHAPTER 6 


TIME-INDEPENDENT 
PERTURBATION THEORY 


6.1 NONDEGENERATE PERTURBATION THEORY 


6.1.1 General Formulation 


Suppose we have solved the (time-independent) Schródinger equation for some 
potential (say, the one-dimensional infinite square well): 


0,,.0 0,,0 
H V, = E, V, . [6. 1] 
obtaining a complete set of orthonormal eigenfunctions, y?, 


(Oy) = dum. [6.2] 


and the corresponding eigenvalues EQ. Now we perturb the potential slightly (say, 
by putting a little bump in the bottom of the well—Figure 6.1). We'd like to find 
the new eigenfunctions and eigenvalues: 


H yn = Envy, [6.3] 


but unless we are very lucky, we're not going to be able to solve the Schródinger 
equation exactly, for this more complicated potential. Perturbation theory is a 
systematic procedure for obtaining approximate solutions to the perturbed problem, 
by building on the known exact solutions to the unperturbed case. 
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V(x) 


FIGURE 6.1: Infinite square well with small perturbation. 


To begin with we write the new Hamiltonian as the sum of two terms: 
H = H°+)H', [6.4] 


where H' is the perturbation (the superscript 0 always identifies the unperturbed 
quantity). For the moment we'll take A to be a small number; later we'll crank it 
up to 1, and H will be the true Hamiltonian. Next we write yy, and E, as power 
series in A: 
0 b 442.52 

Vn = v, TÀV, +i V, Tee : [6.5] 

Ey = EF AE, MES +e. [6.6] 
Here E] is the first-order correction to the nth eigenvalue, and y! is the first-order 
correction to the nth eigenfunction; E? and yŻ are the second-order corrections, 
and so on. Plugging Equations 6.5 and 6.6 into Equation 6.3, we have: 
(HO + AA HAW, A s too] 

= (Ep HAE, HA En + tn FAW, eM bd. 


or (collecting like powers of i): 


Hy HACH? Yy + H'a) EX OI + Hh) e 
= Ep Yn + MEW Wn + En Vn) + EnV + Ena + Ela) se | 
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To lowest order! (4°) this yields H°w® = E°y°, which is nothing new (Equa- 
tion 6.1). To first order (A), 


Hy + Hn = Ent, Ey [6.7] 
To second order (A2), 
Hya + Hy = Eja + E Wa + En Vn [6.8] 


and so on. (I’m done with A, now—it was just a device to keep track of the 
different orders—so crank it up to 1.) 


6.1.2 First-Order Theory 


Taking the inner product of Equation 6.7 with y? (that is, multiplying by (wp) 
and integrating), 


(Wn lH rg) + RL Un) = En hoo) + Ep dlp) 


But H is hermitian, so 
(We Hod!) = (Hove ly) = (EL yll) = Eyll), 


and this cancels the first term on the right. Moreover, (wo |W) = 1l, so? 


E, = (We lH’|e). [6.9] 


This is the fundamental result of first-order perturbation theory; as a practical 
matter, it may well be the most important equation in quantum mechanics. It 
says that the first-order correction to the energy is the expectation value of the 
perturbation, in the unperturbed state. 


Example 6.1 The unperturbed wave functions for the infinite square well are 


(Equation 2.28) 
VO x) = y * sin (=x) 
n Va qw 


! As always (Chapter 2. footnote 25) the uniqueness of power series expansions guarantees that 
the coefficients of like powers are equal. 


?[n this context it doesn't matter whether we write (yo |H 'y9) or (WOH wo) (with the extra. 
vertical bar), because we are using the wave function itself to “label” the state. But the latter notation 
is preferable, because it frees us from this specific convention. 
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V(x) 


FIGURE 6.2: Constant perturbation over the whole well. 


Suppose we perturb the system by simply raising the "floor" of the well a constant 
amount Vo (Figure 6.2). Find the first-order correction to the energies. 


Solution: In this case H’ = Vo, and the first-order correction to the energy of the 
nth state is 


E! = (WPI Volly?) = Vo(woly®) = Vo. 


The corrected energy levels, then, are E, = E? + Vo; they are simply lifted by the 
amount Vo. Of course! The only surprising thing is that in this case the first-order 
theory yields the exact answer. Evidently for a constant perturbation all the higher 
corrections vanish.? On the other hand, if the perturbation extends only half-way 
across the well (Figure 6.3), then 


2Vg [9/7 : V 
El = zu sin? (Zx) dx = a 
a 0 a 2 


In this case every energy level is lifted by Vo/2. That's not the exact result, 
presumably, but it does seem reasonable, as a first-order approximation. 


Equation 6.9 is the first-order correction to the energy; to find the first-order 
correction to the wave function we first rewrite Equation 6.7: 


(H9 — EX), = -(H' — Ej yyy. [6.10] 


?Incidentally. nothing here depends on the specific nature of the infinite square well —the same 
holds for any potential. when the perturbàtion is constant. 


Section 6.1: Nondegenerate Perturbation Theory 253 


FIGURE 6.3: Constant perturbation over half the well. 


The right side is a known function, so this amounts to an inhomogeneous differential 
equation for y]. Now, the unperturbed wave functions constitute a complete set, 
so V] (like any other function) can be expressed as a linear combination of them: 


Vs =F Vn. [6.11] 
mn 


There is no need to include m = n in the sum, for if y! satisfies Equation 6.10, so 
too does (y! + aD), for any constant o, and we can use this freedom to subtract 
off the y? term.* If we could determine the coefficients ct", we'd be done. 

Well, putting Equation 6.11 into Equation 6.10, and using the fact that the y? 


satisfies the unperturbed Schródinger equation (Equation 6.1), we have 


> Em m En) eu = =(H T Ej). 
mn 


Taking the inner product with VP, 


Yo (En — EDD Wow) = PIR wo) + E op bhp). 
msn 


? Alternatively, a glance at Equation 6.5 reveals that any vo component in y might as well 
be pulled out and combined with the first term. In fact, the choice c, 0D = 0 ensures that wn—with 1 
as the coefficient of VO in Equation 6.5—is normalized (to first order in A): (Wnlvin) = (Wipe) + 
ACW WO) + (WELW!) + A36.) +--+, but the orthonormality of the unperturbed states means that 


the first term is 1 and (y! iy D = (wv!) = 0, as long as w! has no w° component. 
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If / =n, the left side is zero, and we recover Equation 6.9; if | 4 n, we get 


(E? — ED c = — (ufa nuo), 


or 
() H' 0 
cm = Em Wa) Mat, [6.12] 
n m 
so 
0 H' 0 
y! = » (Wal Wn) yo, [6.13] 


(En — Em) 


mÆn 


Notice that the denominator is safe (since there is no coefficient with m = n) as 
long as the unperturbed energy spectrum is nondegenerate. But if two different 
unperturbed states share the same energy, we’re in serious trouble (we divided by 
zero to get Equation 6.12); in that case we need degenerate perturbation theory, 
which I'll come to in Section 6.2. 

That completes first-order perturbation theory: The first-order correction to 
the energy, E], is given by Equation 6.9, and the first-order correction to the 
wave function, V, is given by Equation 6.13. I should warn you that whereas 
perturbation theory often yields surprisingly accurate energies (that is, E? 4 El is 


quite close to the exact value E), the wave functions are notoriously poor. 


xProblem 6.1 Suppose we put a delta-function bump in the center of the infinite 
square well: 


H’ = aó(x —a/2), 
where o is a constant. 


(a) Find the first-order correction to the allowed energies. Explain why the ener- 
gies are not perturbed for even n. 


(b) Find the first three nonzero terms in the expansion (Equation 6.13) of the 
correction to the ground state, yi. 


xProblem 6.2 For the harmonic oscillator [V (x) = (1/2)kx?], the allowed ener- 
gies are 


E, = (n+ 1/2ħw, (n=0.1.2,...), 


where w = yk/m is the classical frequency. Now suppose the spring constant 
increases slightly: k — (1 + €)k. (Perhaps we cool the spring, so it becomes less 
flexible.) 


Section 6.1: Nondegenerate Perturbation Theory 255 


(a) Find the exact new energies (trivial, in this case). Expand your formula as a 
power series in €, up to second order. 


(b) Now calculate the first-order perturbation in the energy, using Equation 6.9. 
What is H’ here? Compare your result with part (a). Hint: It is not neces- 
sary —in fact, it is not permitted —to calculate a single integral in doing this 
problem. 


Problem 6.3 Two identical bosons are placed in an infinite square well 
(Equation 2.19). They interact weakly with one another, via the potential 


V (x1, x2) = —a Voĝ (xı — x2) 


(where Vo is a constant with the dimensions of energy, and a is the width of 
the well). 


(a) First, ignoring the interaction between the particles, find the ground state 
and the first excited state—both the wave functions and the associated 
energies. 


(b) Use first-order perturbation theory to estimate the effect of the particle- 
P ry P 
particle interaction on the energies of the ground state and the first excited 
state. 


6.1.3 Second-Order Energies 


Proceeding as before, we take the inner product of the second order equation 
(Equation 6.8) with V9: 


(Wel wt) + (WRIA Wh) = Eb) + En GOAL) + Ez (01D). 
Again, we exploit the hermiticity of H: 
(/21H9y2) = (Howe ln) = EL pla), 


so the first term on the left cancels the first term on the right. Meanwhile, 
(y 9|u9) = 1, and we are left with a formula for E?: 


E; = (pH Ia) — E) Wel). [6.14] 


But 


UM Vi) =}, eu UA Vin) = = 
mn 
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(because the sum excludes m — n, and all the others are orthogonal), so 


(WRIA We) (Wl we ) 
E? -( OA wl) = y ce v9 LR" |y9) = Y m ms zo m 
mn mn n 


or, finally., 


EN aie O K [Walla PW [6.15] 


mn 


This is the fundamental result of second-order perturbation theory. 

We could go on to calculate the second-order correction to the wave func- 
tion (Y2), the third-order correction to the energy, and so on, but in practice 
Equation 6.15 is ordinarily as far as it is useful to pursue this method.> 


*Problem 6.4 


(a) Find the second-order correction to the energies (E?) for the potential 
in Problem 6.1. Comment: You can sum the series explicitly, obtaining 
—2m(a/mhn)* for odd n. 


(b) Calculate the second-order correction to the ground state energy (EG) for the 
potential in Problem 6.2. Check that your result is consistent with the exact 
solution. 


x *Problem 6.5 Consider a charged particle in the one-dimensional harmonic oscil- 


lator potential. Suppose we turn on a weak electric field (E), so that the potential 
energy is shifted by an amount H' = —qEx. 


?In the short-hand notation Vj; = (VO1B'|yD). Aun = Eo 
to the nth energy are 


0 — EQ, the first three corrections 


? [Vin E 3 Vil V 1 Vinn Vim ig 
El =V, E2 = ——. Ev = ————— - V, — 
a i n » Aun " »» Anl Anm m 2. Ain 


lane meu 


The third order correction is given in Landau and Lifschitz, Quantum Mechanics: Non-Relativistic 
Theory, 3rd ed., Pergamon. Oxford (1977), page 136: the fourth and fifth orders (together with a 
powerful general technique for obtaining the higher orders) are developed by Nicholas Wheeler, 
Higher-Order Spectral Perturbation (unpublished Reed College report, 2000). Illuminating alterna- 
tive formulations of time-independent perturbation theory include the Delgarno-Lewis method and 
the closely related “logarithmic” perturbation theory (see, for example, T. Imbo and U. Sukhatme. 
Am. J. Phys. 52. 140 (1984), for LPT, and H. Mavromatis, Am. J. Phys. 59, 738 (1991), for Delgarno- 
Lewis). 
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(a) Show that there is no first-order change in the energy levels, and calculate 
the second-order correction. Hint: See Problem 3.33. 


(b) The Schródinger equation can be solved directly in this case, by a change of 
variables: x’ = x — (qE/ mw). Find the exact energies, and show that they 
are consistent with the perturbation theory approximation. 


6.2 DEGENERATE PERTURBATION THEORY 


If the unperturbed states are degenerate—that is, if two (or more) distinct states 
(y and Vp) share the same energy —then ordinary perturbation theory fails: ce 
(Equation 6.12) and E? (Equation 6.15) blow up (unless, perhaps, the numerator 
vanishes, (VD H'|y9) = 0—a loophole that will be important to us later on). In the 
degenerate case, therefore, there is no reason to trust even the first-order correction 
to the energy (Equation 6.9), and we must look for some other way to handle the 


problem. 


6.2.1 Two-Fold Degeneracy 
Suppose that 


Wy =E Wa. Wb, = Ep. (Waltp) =0. [6.16] 
with y? and y? both normalized. Note that any linear combination of these states, 


V? — ay + BY). [6.17] 
is still an eigenstate of H9, with the same eigenvalue E: 
Hy? = Ey’. [6.18] 


Typically, the perturbation (H^) will "break" (or "lift") the degeneracy: As we 
increase À (from O to 1), the common unperturbed energy E splits into two 
(Figure 6.4). Going the other direction, when we turn off the perturbation, the 
"upper" state reduces down to one linear combination of y? and Vp, and the 
“lower” state reduces to some orthogonal linear combination, but we don't know a 
priori what these “good” linear combinations will be. For this reason we can't even 
calculate the first-order energy (Equation 6.9) — we don't know what unperturbed 
states to use. 

For the moment, therefore, let's just write the "good" unperturbed states in 
generic form (Equation 6.17), keeping œ and f adjustable. We want to solve the 
Schródinger equation, 

Hy = Ey, [6.19] 
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E 


Eo 


1 À 


FIGURE 6.4: “Lifting” of a degeneracy by a perturbation. 


with H = H? -- AH' and 
E-E EAE +E +>, PEP Ay! XN Sun [6.20] 


Plugging these into Equation 6.19, and collecting like powers of A (as before) 
we find 


Hy? + A(H'J9 + Hoy!) p. = EYO ACE JO + Boy!) eee. 
But H949 = E°y° (Equation 6.18), so the first terms cancel; at order A! we have 
H9y! +H’ yp = E9! + El y. [6.21] 
Taking the inner product with Oe 
(yal H Y!) + (pal HY’) = E (halt!) + El Wal). 
Because H? is hermitian, the first term on the left cancels the first term on 


the right. Putting in Equation 6.17 and exploiting the orthonormality condition 
(Equation 6.16), we obtain 


o (9| H9) + BIH’ Iy?) = eE!. 


or, more compactly, 
aWag + BWap = aE!, [6.22] 


where 
Wij = (WEIA WW), Gj — a.b). [6.23] 
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Similarly, the inner product with Vp yields 
o Wb, + BWop = BE". [6.24] 


Notice that the W's are (in principle) known —they are just the "matrix ele- 
ments" of H', with respect to the unperturbed wave functions y? and Vp. Multi- 
plying Equation 6.24 by Wan, and using Equation 6.22 to eliminate 8 Wap, we find: 


o [Was Wha — (E! — Waa)(E! — Wop)] = 0. [6.25] 
If œ is not zero, Equation 6.25 yields an equation for E!: 


(El)? — E! (Waa + Wop) + (Waa Wop — Wab Wha) = 0. [6.26] 


* 
ab* 


Invoking the quadratic formula, and noting (from Equation 6.23) that Wy, = 
we conclude that 


l / 
El = 2 [Waa + Wop ate (Waa = Wop)? + 4|Wapl? | . [6.27] 


This is the fundamental result of degenerate perturbation theory: the two roots 
correspond to the two perturbed energies. 

But what if œ is zero? In that case B = 1, Equation 6.22 says Way = O, 
and Equation 6.24 gives E! = Wpy. This is actually included in the general result 
(Equation 6.27), with the minus sign (the plus sign corresponds to a = 1, B = 0). 
What's more, the answers, 


E} = Waa = (YÌIH' Iy’), EL = Wy = (WRIA D). 


are precisely what we would have obtained using nondegenerate perturbation theory 
(Equation 6.9). We have simply been /ucky: The states y? and v, were already 
the “good” linear combinations. Obviously, it would be greatly to our advantage 
if we could somehow guess the “good” states right from the start —then we could 
go ahead and use nondegenerate perturbation theory. As it turns out, we can very 
often do this by exploiting the following theorem: 


Theorem: Let A be a hermitian operator that commutes with H? and H’. 


If y? and y? (the degenerate eigenfunctions of H?) are also eigenfunctions 
of A, with distinct eigenvalues, 


Ay? = uy. Aw? = vw. and u Æ v. 
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then W,,, = 0 (and hence y? and y? are the “good” states to use in pertur- 
bation theory). 


Proof: By assumption, [A. H'] = 0, so 
(JP LA. H']yg) =0 
= (VS|AH'yp) — (V9 |H'Av]) 
= (AY IA’ Wp) — (yg a'vyp) 
= (u — v)(WolA WP) = (u — v) Wap. 


But u Æ v, so Wap = 0. QED 


Moral: If you’re faced with degenerate states, look around for some hermitian 
operator A that commutes with H? and H'; pick as your unperturbed states ones 
that are simultaneously eigenfunctions of H and A. Then use ordinary first-order 
perturbation theory. If you can’t find such an operator, you’ll have to resort to 
Equation 6.27, but in practice this is seldom necessary. 


Problem 6.6 Let the two “good” unperturbed states be 
Wh =a We + Bap. 


where a+ and 6+ are determined (up to normalization) by Equation 6.22 (or Equa- 
tion 6.24). Show explicitly that 


(a) VO are orthogonal (V |y?) = 0); 


(b) (VS LH |y) = 0: 
(c) (V9. |H'|y9 ) = El, with El, given by Equation 6.27. 


Problem 6.7 Consider a particle of mass m that is free to move in a one-dimen- 
sional region of length L that closes on itself (for instance, a bead that slides 
frictionlessly on a circular wire of circumference L, as in Problem 2.46). 


(a) Show that the stationary states can be written in the form 


Wn (x) = ae (—L/2 <x < L/2). 
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where n = 0, +1, +2, ..., and the allowed energies are 


2 (at) 
E, = — op . 
m L 


Notice that— with the exception of the ground state (n = 0)—these are all 
doubly degenerate. 


(b) Now suppose we introduce the perturbation 
H' Em —Vge^* l 


where a < L. (This puts a little “dimple” in the potential at x = 0, as though 
we bent the wire slightly to make a “trap.”) Find the first-order correction to 
En, using Equation 6.27. Hint: To evaluate the integrals, exploit the fact that 
a « L to extend the limits from +L/2 to too; after all, H’ is essentially 
zero outside —a « x « a. 


(c) What are the “good” linear combinations of w, and y-n, for this problem? 
Show that with these states you get the first-order correction using 
Equation 6.9. 


(d) Find a hermitian operator A that fits the requirements of the theorem, and 
show that the simultaneous eigenstates of H? and A are precisely the ones 
you used in (c). 


6.2.2 Higher-Order Degeneracy 


In the previous section I assumed the degeneracy was two-fold, but it is easy to 
see how the method generalizes. Rewrite Equations 6.22 and 6.24 in matrix form: 


Waa Wab\(a\ _ fif 
te a (;)-5 a: [6.28] 


Evidently the E!’s are nothing but the eigenvalues of the W-matrix; Equation 6.26 
is the characteristic equation for this matrix, and the "good" linear combinations 
of the unperturbed states are the eigenvectors of W. 
In the case of n-fold degeneracy, we look for the eigenvalues of the n x n 
matrix 
Wi; = (VPIH' ly). [6.29] 


In the language of linear algebra, finding the "good" unperturbed wave functions 
amounts to constructing a basis in the degenerate subspace that diagonalizes the 
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matrix W. Once again, if you can think of an operator A that commutes with H', 
and use the simultaneous eigenfunctions of A and H, then the W matrix will 
automatically be diagonal, and you won't have to fuss with solving the charac- 
teristic equation. (If you're nervous about my casual generalization from 2-fold 
degeneracy to n-fold degeneracy, work Problem 6.10.) 


Example 6.2 Consider the three-dimensional infinite cubical well (Problem 4.2): 


0, ifOcx .0 1. andO « z : 
Visto if M «a.0«v«a. an <2< a: [6.30] 
co otherwise. 


The stationary states are 


23 ny nm n-cX 
0 " : E : y . gc 
Vy ais )-2) = (=) sin ( : x) sin (= y) sin (= z) ; [6.31] 


a 


where nx, ny, and n; are positive integers. The corresponding allowed energies are 


2&2 


“Re 


h 
E? - nl + n? + n2). [6.32] 


mn: 2ma 


Notice that the ground state (111) is nondegenerate; its energy is 


zh? 
Ey 35 [6.33] 
But the first excited state is (triply) degenerate: 
Va = Vui. We = Via. and ye = yan. [6.34] 
all share the same energy e 
Eat. [6.35] 
ma 
Now let's introduce the perturbation 
H= i ae « a/2 and 0 « y « a/2; [6.36] 
0. otherwise. 


ÓDegenerate perturbation theory amounts to diagonalization of the degenerate part of the Hamil- 
tonian. The diagonalization of matrices (and simultaneous diagonalizability of commuting matrices) is 
discussed in the Appendix (Section A.5). 


Section 6.2: Degenerate Perturbation Theory 263 


FIGURE 6.5: The perturbation increases the potential by an amount Vy in the shaded 
sector. 


This raises the potential by an amount Vo in one quarter of the box (see Figure 6.5). 
The first-order correction to the ground state energy is given by Equation 6.9: 


E} = (Win lA’ Win) 
2\3 4/2. | a/2 , m a ym 
= (=) v f sin (5x) ax f sin (5) av f sin (=z) dz 
1 
= -Vp, 6.3 
40: [6.37] 


which is pretty much what we would expect. 

For the first excited state we need the full machinery of degenerate perturbation 
theory. The first step is to construct the matrix W. The diagonal elements are the 
same as for the ground state (except that the argument of one of the sines is 
doubled); you can check for yourself that 


l 
Waa = Wop = Wee = 4 YO: 


The off-diagonal elements are more interesting: 
aXe ule m 
Wap = (=) vo | sin? (=x) dx 
a 0 a 
BEL. E .(?x a (2r . [T 
x sin (=y) sin | —y] dy sin | —z } sin (=z) dz. 
0 a a 0 a ea 
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But the z integral is zero (as it will be also for Wac), so 
Wab = Wac = 0. 
Finally, 


2 3 a/2 2 
Wie = (=) vo | sin (=x) sin (Z) dx 
a 0 a a 
20 


Thus 
Vo 1 0 0 
Wo 0 1k [6.38] 
O0 x 1 


where k = (8/37)? © 0.7205. 
The characteristic equation for W (or rather, for 4W/ Vo, which is easier to 
work with) is 


(1— wY —?(1— w) =0. 
and the eigenvalues are 
wj—l; wr=1+x 7 1.705; w3=1-—Kk % 0.2795. 
To first order in A, then, 
E? + 1 V/A, 
E\(A) = { E? +A(1 + k)Vo/4. [6.39] 
E? +A(1 — k)Vo/4. 
where E? is the (common) unperturbed energy (Equation 6.35). The perturbation 
lifts the degeneracy, splitting E? into three distinct energy levels (see Figure 6.6). 
Notice that if we had naively applied nondegenerate perturbation theory to this 
problem, we would have concluded that the first-order correction (Equation 6.9) is 
the same for all three states, and equal to Vo/4— which is actually correct only for 


the middle state. 
Meanwhile, the “good” unperturbed states are linear combinations of the form 


y? = oa + Bl +y Vie. [6.40] 


where the coefficients (a, 8, and y) form the eigenvectors of the matrix W: 


1 0 0 a a 
0 l1 x Bl -wv|fg 
Ok 1 y y 


nS E S 
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E,(A) 


1 À 


FIGURE 6.6: Lifting of the degeneracy in Example 6.2 (Equation 6.39). 


For w = 1 we get œ = 1, 8h = y = 0; forw=1t«K we get œ = 0, $ = ty = 
1/4/2. (I normalized them as I went along.) Thus the “good” states are’ 


Wa: 
Wo = d (Wo + V/A. [6.41] 
(Wo — We) /V2. 


Problem 6.8 Suppose we perturb the infinite cubical well (Equation 6.30) by 
putting a delta function “bump” at the point (a/4, a/2, 3a/4): 


H' = a? Vod(x — a/4)8(y — a/2)8(z — 3a /4). 


Find the first-order corrections to the energy of the ground state and the (triply 
degenerate) first excited states. 


7We might have guessed this result right from the start by noting that the operator Pyy. which 
interchanges x and y. commutes with H’. Its eigenvalues are +1 (for functions that are even under the 
interchange), and —1 (for functions that are odd). In this case Wy is already even. (Yp + We) is even, 
and (Vj — We) is odd. This is not quite conclusive, since any linear combination of the even states 
would still be even. But if we also use the operator Q, which takes z to a — z, and note that y, is 
an eigenfunction with eigenvalue —1, whereas the other two are eigenfunctions with eigenvalue +1, 
the ambiguity is resolved. Here the operators Pyy and Q together play the role of A in the theorem of 
Section 6.2.1. 
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xProblem 6.9 Consider a quantum system with just three linearly independent 
states. Suppose the Hamiltonian, in matrix form, is 


(l—e) 0 0 
H = Vo 0 l e 
0 e 2 


where Vo is a constant, and € is some small number (e < 1). 


(a) Write down the eigenvectors and eigenvalues of the unperturbed Hamiltonian 
(e — 0). 


(b) Solve for the exact eigenvalues of H. Expand each of them as a power series 
in €, up to second order. 


(c) Use first- and second-order nondegenerate perturbation theory to find the 
approximate eigenvalue for the state that grows out of the nondegenerate 
eigenvector of H 0. Compare the exact result, from (a). 


(d) Use degenerate perturbation theory to find the first-order correction to the 
two initially degenerate eigenvalues. Compare the exact results. 


Problem 6.10 In the text I asserted that the first-order corrections to an n-fold 
degenerate energy are the eigenvalues of the W matrix, and I justified this claim as 
the "natural" generalization of the case n — 2. Prove it, by reproducing the steps 
in Section 6.2.1, starting with 


n 
0 0 
LA =) apy; 
j=l 


(generalizing Equation 6.17), and ending by showing that the analog to 
Equation 6.22 can be interpreted as the eigenvalue equation for the matrix W. 


6.3 THE FINE STRUCTURE OF HYDROGEN 


In our study of the hydrogen atom (Section 4.2) we took the Hamiltonian to be 


hz 2 
i | E [6.42] 


H=-—vV*- 
2m 4neor 


(electron kinetic energy plus coulombic potential energy). But this is not quite 
the whole story. We have already learned how to correct for the motion of the 
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TABLE 6.1: Hierarchy of corrections to the 
Bohr energies of hydrogen. 


Bohr energies: oforder  o?mc? 
Fine structure: oforder omc? 


Lamb shift: of order  o?mc? 
Hyperfine splitting: of order (m/m,)a 


4 p2 


mec 


nucleus: Just replace m by the reduced mass (Problem 5.1). More significant is 
the so-called fine structure, which is actually due to two distinct mechanisms: a 
relativistic correction, and spin-orbit coupling. Compared to the Bohr energies 
(Equation 4.70), fine structure is a tiny perturbation— smaller by a factor of o?, 
where 
2 
e^ 1 


^ 


4reéohic 137.036 


a= [6.43] 
is the famous fine structure constant. Smaller still (by another factor of œ) is the 
Lamb shift, associated with the quantization of the electric field, and smaller by 
yet another order of magnitude is the hyperfine structure, which is due to the 
magnetic interaction between the dipole moments of the electron and the proton. 
This hierarchy is summarized in Table 6.1. In the present section we will analyze 
the fine structure of hydrogen, as an application of time-independent perturbation 
theory. 


Problem 6.11 


(a) Express the Bohr energies in terms of the fine structure constant and the rest 
energy (mc?) of the electron. 


(b 


~ 


` Calculate the fine structure constant from first principles (i.e., without recourse 
to the empirical values of €o, e, fi, and c). Comment: The fine structure con- 
stant is undoubtedly the most fundamental pure (dimensionless) number in all 
of physics. It relates the basic constants of electromagnetism (the charge of 
the electron), relativity (the speed of light), and quantum mechanics (Planck's 
constant). If you can solve part (b), you have the most certain Nobel Prize 
in history waiting for you. But I wouldn't recommend spending a lot of time 
on it right now; many smart people have tried, and all (so far) have failed. 


6.3.1 The Relativistic Correction 


The first term in the Hamiltonian is supposed to represent kinetic energy: 


1 
T= jmv = —, [6.44] 
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and the canonical substitution p — (A/i)V yields the operator 


A^ s 
T 2——V*. [6.45] 
2m 
But Equation 6.44 is the classical expression for kinetic energy; the relativistic 
formula is 5 
C= a tegi: [6.46] 


V1 — (vc? 


The first term is the toral relativistic energy (not counting potential energy, 
which we aren't concerned with at the moment), and the second term is the rest 
energy—the difference is the energy attributable to motion. 

We need to express T in terms of the (relativistic) momentum, 


mv 


P = ———c, [6.47] 
v1- (v/c 
instead of velocity. Notice that 
2,2,2 24 2 2,4 
22 24 m^vc-m^c[1— (v/o)] merc 23 
9 2 L———————— zm —————s-(qT 242 
pc +m ce TEES E (T 4- mc*) 


SO 
T = p2c? + m2c4 -mc. [6.48] 


This relativistic equation for kinetic energy reduces (of course) to the classical 
result (Equation 6.44), in the nonrelativistic limit p « mc; expanding in powers 
of the small number (p/mc), we have j 


py? 2 lp pae. dy 
= ic? 1 =) -lj|= come] B = =) — — 2) e.. — Í 
HIC a (Z i | i 2 e 8 E 


p? pî 
= — — 6.49 
2m | 8m?c? [6.43] 

The lowest-order® relativistic correction to the Hamiltonian is evidently 
4 
p 

H, = -—. 6.50 
&n?c? [6.50] 


8The kinetic energy of the electron in hydrogen is on the order of 10 eV. which is miniscule 
compared to its rest energy (511.000 eV). so the hydrogen atom is basically nonrelativistic. and we 
can afford to keep. only the lowest-order correction. In Equation 6.49. p is the relativistic momentum 
(Equation 6.47), nor the classical momentum mv. It is the former that we now associate with the 
quantum operator —i/ V, in Equation 6.50. 
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In first-order perturbation theory, the correction to E, is given by the expec- 
tation value of H’ in the unperturbed state (Equation 6.9): 


! l 2 2 
E! = (H) == -3a Wie) = -azza U Vp y). [6.51] 


Now, the Schrödinger equation (for the unperturbed states) says 


ply =2m(E — Vy, [6.52] 
and hence? 
El = — i (E—vyyz- | [Eh —2E(V) + (V3). [6.53] 
mc 2mc* 


So far this is entirely general; but we're interested in hydrogen, for which V(r) = 
—(1/4z eye? [r: 


ise Eas (<5 ‘+ (5 (2 [6.54] 
r 2mc " Veo) Vr 47t €9 rl i 


where E, is the Bohr energy of the state in question. 
To complete the job, we need the expectation values of 1/r and 1/r?, in the 
(unperturbed) state V/4j,, (Equation 4.2.1). The first is easy (see Problem 6.12): 


DM gs 


n^a 


where a is the Bohr radius (Equation 4.72). The second is not so simple to derive 
(see Problem 6.33), but the answer is!? 


1 1 | 
(5) ~ (+ 1/2)n3 a2’ ud 


It follows that 


1 e \ 1 e x^ 1 
E! = —— | E? + 2E, EAO Pads See oe el 
i 2mc? = (zz) n?a m (zz) (i+ rad 


?There is some sleight-of-hand in this maneuver, which exploits the hermiticity of p? and of 
(E — V). In truth, the operator p is not hermitian for states with ? = 0 (see Problem 6.15), and 
the applicability of perturbation theory to Equation 6.50 is therefore called into question (for the case 
| = 0). Fortunately. the exact solution is available; it can be obtained by using the (relativistic) Dirac 
equation in place of the (nonrelativistic) Schródinger equation, and it confirms the results we obtain 
here by less rigorous means (see Problem 6.19). 

l0The general formula for the expectation value of any power of r is given in Hans A. Bethe and 
Edwin E. Salpeter, Quantum Mechanics of One- and Two-Electron Atoms, Plenum, New York (1977), 
p. 17. 
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or, eliminating a (using Equation 4.72) and expressing everything in terms of E, 
(using Equation 4.70): 


(En) [ 4n 
E! =— ee 5 
f 2mc? È 1/2 3] d 


Evidently the relativistic correction is smaller than Ej, by a factor of about 
E,/mc? =2 x 1075. 

You might have noticed that I used nondegenerate perturbation theory in this 
calculation (Equation 6.51), in spite of the fact that the hydrogen atom is highly 
degenerate. But the perturbation is spherically symmetrical, so it commutes with 
L? and L-. Moreover, the eigenfunctions of these operators (taken together) have 
distinct eigenvalues for the n? states with a given E,. Luckily, then, the wave 
functions Whim are the “good” states for this problem (or, as we say, n, l, and 
m are the good quantum numbers), so as it happens the use of nondegenerate 
perturbation theory was legitimate (see the “Moral” to Section 6.2.1). 


xProblem 6.12 Use the virial theorem (Problem 4.40) to prove Equation 6.55. 


Problem 6.13 In Problem 4.43 you calculated the expectation value of r* in the 


state W321. Check your answer for the special cases s = O (trivial), s = —1 
(Equation 6.55), s = —2 (Equation 6.56), and s = —3 (Equation 6.64). Comment 
on the case s — —7. 


x xProblem 6.14 Find the (lowest-order) relativistic correction to the energy levels of 
the one-dimensional harmonic oscillator. Hint: Use the technique in Example 2.5. 


* x «Problem 6.15 Show that p? is hermitian, but p^ is not, for hydrogen states with 
| — 0. Hint: For such states y is independent of 0 and ¢, so 


?. hÀ dd 
im r? dr "dr 


(Equation 4.13). Using integration by parts, show that 


(flp?g) = -4n h? (7 ls ir) | + (G^ fg). 
r ] 0 


Check that the boundary term vanishes for y,99, which goes like 


1 
Vn00 ~ ay exp(—r/na) 
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near the origin. Now do the same for p^, and show that the boundary terms do not 
vanish. In fact: 

8h^* (n — m) 
at (nm)? 


(Wnool P Wmoo) = + (p WnoolWmoo). 


6.3.2 Spin-Orbit Coupling 


Imagine the electron in orbit around the nucleus; from the electron’s point of view, 
the proton is circling around it (Figure 6.7). This orbiting positive charge sets up 
a magnetic field B, in the electron frame, which exerts a torque on the spinning 
electron, tending to align its magnetic moment (jz) along the direction of the field. 
The Hamiltonian (Equation 4.157) is 


To begin with, we need to figure out the magnetic field of the proton (B) and the 
dipole moment of the electron (4L). 


The magnetic field of the proton. If we picture the proton (from the elec- 
tron’s perspective) as a continuous current loop (Figure 6.7), its magnetic field can 
be calculated from the Biot-Savart law: 


poe. 
2r 
with an effective current J = e/T, where e is the charge of the proton and T is 
the period of the orbit. On the other hand, the orbital angular momentum of the 
electron (in the rest frame of the nucleus) is L — rmv — 2mnr? / T. Moreover, B 
and L point in the same direction (up, in Figure 6.7), so 
1 e 


[6.59] 


4z eo mc?r? 


(I used c = 1/./eouo to eliminate jo in favor of €g.) 


B,L 


——* FIGURE 6.7: Hydrogen atom, from the 
+e electron’s perspective. 


272 


Chapter 6 Time-Independent Perturbation Theory 


The magnetic dipole moment of the electron. The magnetic dipole moment 
of a spinning charge is related to its (spin) angular momentum; the proportionality 
factor is the gyromagnetic ratio (which we already encountered in Section 4.4.2). 
Let’s derive it, this time, using classical electrodynamics. Consider first a charge 
q smeared out around a ring of radius r, which rotates about the axis with period 
T (Figure 6.8). The magnetic dipole moment of the ring is defined as the current 
(q/ T) times the area (srr): 


> 
ame 
H—UT 


If the mass of the ring is m, its angular momentum is the moment of inertia (mr?) 
times the angular velocity (2x / T): 


2z mr? 
DEEP: 

The gyromagnetic ratio for this configuration is evidently u/S = q/2m. Notice 
that it is independent of r (and T). If I had some more complicated object, such 
as a sphere (all I require is that it be a figure of revolution, rotating about its 
axis), I could calculate 4 and S by chopping it into little rings, and adding up 
their contributions. As long as the mass and the charge are distributed in the same 
manner (so that the charge-to-mass ratio is uniform), the gyromagnetic ratio will 
be the same for each ring, and hence also for the object as a whole. Moreover, the 
directions of 44 and S are the same (or opposite, if the charge is negative), so 


n7 (3,)8 


That was a purely classical calculation, however; as it turns out the electron's 
magnetic moment is twice the classical value: 


uoce [6.60] 


m 


FIGURE 6.8: A ring of charge, rotating 
about its axis. 
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The “extra” factor of 2 was explained by Dirac, in his relativistic theory of the 
electron.!! 
Putting all this together, we have 


2 
e* l 
H = ———S.L. 
(zz) m2c2r3 


But there is a serious fraud in this calculation: I did the analysis in the rest frame 
of the electron, but that's not an inertial system —it accelerates, as the electron 
orbits around the nucleus. You can get away with this if you make an appropriate 
kinematic correction, known as the Thomas precession.!? In this context it throws 
in a factor of 1/2:? 


E 
e* 1 
H? = | — | — Seb: 6.61 

39 CE 90H 


This is the spin-orbit interaction; apart from two corrections (the modified gyro- 
magnetic ratio for the electron and the Thomas precession factor— which, coinci- 
dentally, exactly cancel one another) it is just what you would expect on the basis 
of a naive classical model. Physically, it is due to the torque exerted on the mag- 
netic dipole moment of the spinning electron, by the magnetic field of the proton, 
in the electron's instantaneous rest frame. 

Now the quantum mechanics. In the presence of spin-orbit coupling, the 
Hamiltonian no longer commutes with L and S, so the spin and orbital angu- 
lar momenta are not separately conserved (see Problem 6.16). However, Hj, does 
commute with L?, S? and the total angular momentum 


J=L+S, [6.62] 


!! We have already noted that it can be dangerous to picture the electron as a spinning sphere (see 
Problem 4.25), and it is not too surprising that the naive classical model gets the gyromagnetic ratio 
wrong. The deviation from the classical expectation is known as the g-factor: jj = g(q/2m)S. Thus 
the g-factor of the electron. in Dirac's theory, is exactly 2. But quantum electrodynamics reveals tiny 
corrections to this: ge is actually 2+ (@/7)+... = 2.002.... The calculation and measurement (which 
agree to exquisite precision) of the so-called anomalous magnetic moment of the electron were among 
the greatest achievements of twentieth-century physics. 


I*One way of thinking of it is that the electron is continually stepping from one inertial system to 
another; Thomas precession amounts to the cumulative effect of all these Lorentz transformations. We 
could avoid the whole problem. of course, by staying in the /ab frame, in which the nucleus is at rest. 
In that case the field of the proton is purely electric. and you might wonder why it exerts any torque 
on the electron. Well, the fact is that a moving magnetic dipole acquires an electric dipole moment, 
and in the lab frame the spin-orbit coupling is due to the interaction of the electric field of the nucleus 
with the electric dipole moment of the electron. Because this analysis requires more sophisticated 
electrodynamics, it seems best to adopt the electron's perspective, where the physical mechanism is 
more transparent. 


I3More precisely, Thomas precession subtracts | from the g factor. See R. R. Haar and 
L. J. Curtis, Am. J. Phys., 55, 1044 (1987). 
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and hence these quantities are conserved (Equation 3.71). To put it another way, 
the eigenstates of L- and S- are not “good” states to use in perturbation theory, 
but the eigenstates of L?, S?, J?, and J. are. Now 


J? = (L+S) - (L+S) = L? +S? +2L.-S, 


so 
2 


1 
L-S= su mq, [6.63] 


and therefore the eigenvalues of L - S are 


2 


h2 
ZUG + 1)-M1 +1) —s(s + 0]. 


In this case, of course, s = 1/2. Meanwhile, the expectation value of 1/ r? (see 
Problem 6.35(c)) is 


l = [6.64] 
r| 1+ 1/2) + Dn?a3' i 
and we conclude that 


| € 1 (DIGI) -— Il +1) -— 3/4] 
— 8eo m?c? Il +1/2)( + 1)n3a3 


or, expressing it all in terms of E,:'4 


2 ose _ = 
E! _ (En) [uu TUER M [6.65] 


so mc? M+ 1/2)(1 4-1) 


It is remarkable, considering the totally different physical mechanisms 
involved, that the relativistic correction and the spin-orbit coupling are of the same 
order (E 7 mc?). Adding them together, we get the complete fine-structure formula 
(see Problem 6.17): 


E)? 4 
pa (3 2 er) | [6.66] 
2mc* j+ 1/2 


Once again. the case / = O is problematic. since we are ostensibly dividing by zero. On the 
other hand, the numerator is also zero. since in this case j = s. so Equation 6.65 is indeterminate. 
On physical grounds there shouldn't be any spin-orbit coupling when / = 0. One way to resolve the 
ambiguity is to introduce the so-called Darwin term (sec. for instance, G. K. Woodgate, Elementary 
Atomic Structure, 2nd ed.. Oxford (1983). p. 63). Serendipitously, even though both the relativistic 
correction (Equation 6.57) and the spin-orbit coupling (Equation 6.65) are questionable in the case 
| — 0. their sum (Equation 6.66) is correct for ail 1 (sce Problem 6.19). 
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n=3 —€— Ó€——— ———————— E e152 


1=0 121 1-2 | 23 
(S) (P) (D) (F) 


FIGURE 6.9: Energy levels of hydrogen, including fine structure (not to scale). 


Combining this with the Bohr formula, we obtain the grand result for the energy 
levels of hydrogen, with fine structure included: 


[6.67] 


2 
d. =m i) 
(m 4/] 


Fine structure breaks the degeneracy in / (that is, for a given n, the different 
allowed values of / do not all carry the same energy), but it still preserves degener- 
acy in j (see Figure 6.9). The z-component eigenvalues for orbital and spin angular 
momentum (mm; and ms) are no longer “good” quantum numbers—the stationary 
states are linear combinations of states with different values of these quantities; the 
“good” quantum numbers are n, l, s, j, and m ;.? 


IST write |j mj) (for given 7 and s) as a linear combination of |/ mj)|s ms} we would use the 
appropriate Clebsch-Gordan coefficients (Equation 4.185). 
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Problem 6.16 Evaluate the following commutators: (a) [L - S, L], (b) [L - S. S], 
(c) [L - S, J], (d) [L-S, L7], (e) [L - S, S?], (f) [L < S. J?]. Hint: L and S satisfy 
the fundamental commutation relations for angular momentum (Equations 4.99 
and 4.134), but they commute with each other. 


xProblem 6.17 Derive the fine structure formula (Equation 6.66) from the relativis- 


tic correction (Equation 6.57) and the spin-orbit coupling (Equation 6.65). Hint: 
Note that j = ! + 1/2; treat the plus sign and the minus sign separately, and you'll 
find that you get the same final answer either way. 


x xProblem 6.18 The most prominent feature of the hydrogen spectrum in the visible 


region is the red Balmer line, coming from the transition n = 3 ton = 2. First of all, 
determine the wavelength and frequency of this line according to the Bohr theory. 
Fine structure splits this line into several closely spaced lines; the question is: How 
many, and what is their spacing? Hint: First determine how many sublevels the 
n = 2 level splits into, and find Ej. for each of these, in eV. Then do the same for 
n = 3. Draw an energy level diagram showing all possible transitions from n = 3 
to n = 2. The energy released (in the form of a photon) is (E3 — E2)+ AE, the first 
part being common to all of them, and the AE (due to fine structure) varying from 
one transition to the next. Find AE (in eV) for each transition. Finally, convert 
to photon frequency, and determine the spacing between adjacent spectral lines (in 
Hz)— ot the frequency interval between each line and the unperturbed line (which 
is, of course, unobservable), but the frequency interval between each line and the 
next one. Your final answer should take the form: "The red Balmer line splits into 
(???) lines. In order of increasing frequency, they come from the transitions (1) 
J = Q9?) to j = (279), (2) j = (277) to j = (???), .... The frequency spacing 
between line (1) and line (2) is (???) Hz, the spacing between line (2) and line (3) 
is (???) Hz, ....” 


Problem 6.19 The exact fine-structure formula for hydrogen (obtained from the 
Dirac equation without recourse to perturbation theory) is!Ó 


31-12 
( a ) 
n— (j +1/2)+ y (j 1/27 — a? 


Expand to order o^ (noting that œ « 1), and show that you recover Equation 6.67. 


Enj = mc? 1+ 


lÓóBethe and Salpeter (footnote. 10). page 238. 
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6.4 THE ZEEMAN EFFECT 


When an atom is placed in a uniform external magnetic field Bex, the energy levels 
are shifted. This phenomenon is known as the Zeeman effect. For a single electron, 
the perturbation is 

H; = —(H T hs) : Bext, [6.68] 


where ; 
h, =——S [6.69] 
m 
is the magnetic dipole moment associated with electron spin, and 


e 


Hi = ——L [6.70] 
2m 
is the dipole moment associated with orbital motion.!” Thus 
Hi = —(L+2S) - Ben. [6.71] 
2m 


The nature of the Zeeman splitting depends critically on the strength of the 
external field in comparison with the internal field (Equation 6.59) that gives rise 
to spin-orbit coupling. If Box, « Bim, then fine structure dominates, and H; can 
be treated as a small perturbation, whereas if Bex, >> Bin, then the Zeeman effect 
dominates, and fine structure becomes the perturbation. In the intermediate zone, 
where the two fields are comparable, we need the full machinery of degenerate 
perturbation theory, and it is necessary to diagonalize the relevant portion of the 
Hamiltonian “by hand." In the following sections we shall explore each of these 
regimes briefly, for the case of hydrogen. 


Problem 6.20 Use Equation 6.59 to estimate the internal field in hydrogen, and 
characterize quantitatively a "strong" and "weak" Zeeman field. 


6.4.1 Weak-Field Zeeman Effect 


If Bex «€ Bim, fine structure dominates (Equation 6.67); the "good" quantum 
numbers are n, /, j, and m; (but not m; and ms, because—in the presence of spin- 
orbit coupling —L and S are not separately conserved).!? In first-order perturbation 


The gyromagnetie ratio for orbital motion is just the classical value (q/2m)—it is only for 
spin that there is an "extra" factor of 2. 


'81n this problem we have a perturbation (Zeeman splitting) piled on top of a perturbation (fine 
structure). The "good" quantum numbers are those appropriate to the dominant perturbation—in this 
case the fine strueture. The secondary perturbation (Zceman splitting) lifts the remaining degencracy in 
J.. which plays here the role of the operator A in the theorem of Section 6.2.1. Technically. J- does 
not commute with H}. but it does in the time average sense of Equation 6.73. 
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FIGURE 6.10: In the presence of spin-orbit cou- 
pling, L and S are not separately conserved; they 
precess about the fixed total angular momentum, J. 


theory, the Zeeman correction to the energy is 
e 


Bex: © (L + 2S). [6.72] 
2m 


EL = (nljm j|Hz|nl jm;) = 


Now L 4- 28 = J+S. Unfortunately, we do not immediately know the expectation 
value of S. But we can figure it out, as follows: The total angular momentum 
J = L + S is constant (Figure 6.10); L and S precess rapidly about this fixed 
vector. In particular, the (time) average value of S is just its projection along J: 


S-J) 
J2 


But L = J — S, so L? = J? + S? — 2J - S, and hence 


J. [6.73] 


Save = 


15s , M 
$-J=5(P +9 -L= ZUG +1)+s(s+1)—-I(0+1)]. 16.74] 


from which it follows that 


5) ) | ijg-D-lg-4 md | 
L+ 2S) =({ 1+ Jj2|14 = I. 6.75 
| ( J? 2j(j +1) e end 
The term in square brackets is known as the Landé g-factor, g;. 
We may as well choose the z-axis to lie along Bex: then 

E, = upg; Bes j. [6.76] 
where 

eh 25 

uB = T 5.788 x 107° eV/T [6.77] 
m 


is the so-called Bohr magneton. The total energy is the sum of the fine-structure 
part (Equation 6.67) and the Zeeman contribution (Equation 6.76). For example, 
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the ground state (n = 1, / = 0, j = 1/2, and therefore g; = 2) splits into two 
levels: 
—13.6 eV(1-- 02/4) t up Bex. [6.78] 


with the plus sign for m; — 1/2, and minus for m; — —1/2. These energies are 
plotted (as functions of Bexi) in Figure 6.11. 


xProblem 6.21 Consider the (eight) n = 2 states, |2/jm;). Find the energy of each 
state, under weak-field Zeeman splitting, and construct a diagram like Figure 6.11 
to show how the energies evolve as Bex, increases. Label each line clearly, and 
indicate its slope. 


6.4.2. Strong-Field Zeeman Effect 


If Bext >> Bim, the Zeeman effect dominates;'? with Be, in the z direction, the 
"good" quantum numbers are now n, /, mj, and ms (but not j and m; because—in 
the presence of the external torque —the total angular momentum is not conserved, 
whereas L. and S. are). The Zeeman Hamiltonian is 


e 
H? = — B L- + 25- s 
25 ex (Lz z) 
E 
HgBext 
—18.6 (1 + 07/4) eV 
m; = -1/2 


FIGURE 6.11: Weak-field Zeeman splitting of the ground state of hydrogen; the 
upper line (mm; = 1/2) has slope 1, the lower line (m; = —1/2) has slope —1. 


I? this regime the Zeeman effeet is also known as the Paschen-Back effect. 
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and the “unperturbed” energies are 


13.6 eV 
Enmms m Ro + MB Bex (ni + 2m;). [6.79] 


That's the answer, if we ignore fine structure completely. But we can do better. 
In first-order perturbation theory the fine-structure correction to these levels is 


EL = (nl mı ms|(H, + H,,)|nl mi ms). [6.80] 


The relativistic contribution is the same as before (Equation 6.57); for the spin-orbit 
term (Equation 6.61) we need 


(S: L) = (Sy)(Lx) + (Sy) (Ly) + (Sz)(Ly) = Hmm, [6.81] 


(note that (Sx) = (Sy) = (Lx) = (Ly) = 0 for eigenstates of S. and L;). Putting 
all this together (Problem 6.22), we conclude that 


El = [6.82] 


13.6 eV. » 3 E Id 4- 1) — mim; 
n? 4n Lid 21/204 D]1]- 


(The term in square brackets is indeterminate for / — 0; its correct value in this 
case is 1 — see Problem 6.24.) The total energy is the sum of the Zeeman part 
(Equation 6.79) and the fine structure contribution (Equation 6.82). 


Problem 6.22 Starting with Equation 6.80, and using Equations 6.57, 6.61, 6.64, 
and 6.81, derive Equation 6.82. 


x xProblem 6.23 Consider the (eight) n = 2 states, |2/ mj ms}. Find the energy of 


each state, under strong-field Zeeman splitting. Express each answer as the sum of 
three terms: the Bohr energy, the fine-structure (proportional to o?), and the Zeeman 
contribution (proportional to 44g Bexi). If you ignore fine structure altogether, how 
many distinct levels are there, and what are their degeneracies? 


Problem 6.24 If | — 0, then j = s, mj = ms, and the “good” states are the same 
([n m;)) for weak and strong fields. Determine E 5 (from Equation 6.72) and the 
fine structure energies (Equation 6.67), and write down the general result for the 
| — 0 Zeeman effect— regardless of the strength of the field. Show that the strong- 
field formula (Equation 6.82) reproduces this result, provided that we interpret the 
indeterminate term in square brackets as 1. 
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6.4.3 Intermediate-Field Zeeman Effect 


In the intermediate regime neither H; nor H;. dominates, and we must treat the 
two on an equal footing, as perturbations t to the Bohr Hamiltonian (Equation 6.42): 


I'll confine my attention here to the case n = 2, and use as the basis for degen- 
erate perturbation theory the states characterized by /, j, and m ue Using the 
Clebsch-Gordan coefficients (Problem 4.51 or Table 4.8) to express |j mj) as a 
linear combination of |/ m;)|s ms), we have: 


wi =15 4) -100)5 5). 


f=0 
y2 = |} 3D = 100)15 5) 
¥3 =13 5) — I1 0I 4) 
wa = 1$ x) = [1 - 1015 3) 
- vs = 139) -42/00153) + VTN 0$ 35. 


We =1$5) = -VIBU O $) + 2/1113. 
;21$3)241731-01523). + 42/3110) 54). 
Vs = 13 F) = —V273|1 -1)l3 3) + T/3110)5 3. 


In this basis the nonzero matrix elements of H;, are all on the diagonal, 
and given by Equation 6.66; H7, has four off-diagonal elements, and the complete 
matrix —W is (see Problem 6.25): 


j= 


sy-B 0 0 0 0 0 0 0 
0 Sy+B 0 0 0 0 0 0 
0 0 y-28 0 0 0 0 0 
0 0 0 y+2B 0 0 0 0 
2 2 
0 0 0 dE. X8 o0 0 
0 0 0 0 3B Sy-iB 0 0 
0 0 0 0 0 0 y+ip 2p 
0 0 0 0 0 0 2g  5y41B 


20 You ean use /, mj. ms states if you prefer—this makes the matrix elements of H; easier, but 
those of Hy. more difficult; the W matrix will be more complicated, but its eigenvalues (which are 
independent of basis) are the same either way. 
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where 
y = (@/8)713.6 eV and $= Ug Bex. 


The first four eigenvalues are already displayed along the diagonal; it remains only 


to find the eigenvalues of the two 2 x 2 blocks. The characteristic equation for the 


first of these is 
jal 
7 - A(6y — B) + (s? — Tyg) = 0, 


and the quadratic formula gives the eigenvalues: 


As = —3y + (8/2) E y 4y? + (2/3)y + (8/4). [6.84] 


The eigenvalues of the second block are the same, but with the sign of B reversed. 
The eight energies are listed in Table 6.2, and plotted against Bexı in Figure 6.12. 
In the zero-field limit (8 = 0) they reduce to the fine-structure values; for weak 
fields (8 « y) they reproduce what you got in Problem 6.21; for strong fields 
(B >> y) we recover the results of Problem 6.23 (note the convergence to five 
distinct energy levels, at very high fields, as predicted in Problem 6.23). 
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FIGURE 6.12: Zeeman splitting of the » = 2 states of hydrogen, in the weak, 
intermediate, and strong field regimes. 
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TABLE 6.2: Energy levels for the n = 2 states of 
hydrogen, with fine structure and Zeeman splitting. 


E;—5ytf 


& = Ej-5y-B 
€ = E,—-y+2B 
€ = Ex-y-2p 
és = Ej-3y« BD « Nay? + QI3yB + B24 


€, = Ej-3y« B2 - Nay? - QI3yB + B24 
= Ej-3y- BD + NAy? — QI3yB + F.A 
E, — 3y- BI2 — NAy? — (2/3) yB + B2/4 


Problem 6.25 Work out the matrix elements of H7 and H; 


[s and construct the 
W -matrix given in the text, for n = 2. 


x x xProblem 6.26 Analyze the Zeeman effect for the n = 3 states of hydrogen, in the 
weak, strong, and intermediate field regimes. Construct a table of energies (analo- 
gous to Table 6.2), plot them as functions of the external field (as in Figure 6.12), 
and check that the intermediate-field results reduce properly in the two limiting 
cases. 


6.5 HYPERFINE SPLITTING 


The proton itself constitutes a magnetic dipole, though its dipole moment is much 
smaller than the electron’s because of the mass in the denominator (Equation 6.60): 


§ pe e 
= ——S,, = ——§,. 6.85 
2M p Be DE me | ] 


p 
(The proton is a composite structure, made up of three quarks, and its gyromagnetic 
ratio is not as simple as the electron’s—hence the explicit g-factor (gp), whose 
measured value is 5.59, as opposed to 2.00 for the electron.) According to classical 
electrodynamics, a dipole j, sets up a magnetic field?! 


2 
HO uis FF - u] + H pE). [6.86] 


B= 
4nr? 3 


2l 1f you are unfamiliar with the delta function term in Equation 6.86. you can derive it by treating 
the dipole as a spinning charged spherical shell. in the limit as the radius goes to zero and the charge 
goes to infinity (with æ held constant). See D, J. Griffiths. Am. J. Phys.. 50. 698 (1982). 
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So the Hamiltonian of the electron, in the magnetic field due to the proton’s mag- 
netic dipole moment, is (Equation 6.58) 


Hogpe [3(Sp -?)(Se-?) — Sp - Se] ic Mog pe 
-3 


H}. = 
hf — 8 3 
TIM pMe ] m ple 


Sp: Se? (r). [6.87] 
According to perturbation theory, the first-order correction to the energy 
(Equation 6.9) is the expectation value of the perturbing Hamiltonian: 


jo Mog pe” (= XS, 5) - Sp Se 
h gam pe r? ) 

5 [6.88] 

HOS pe 

3npm, 


(Sp - Se)|¥r(0))?. 


In the ground state (or any other state for which / = 0) the wave function is spher- 
ically symmetrical, and the first expectation value vanishes (see Problem 6.27). 
Meanwhile, from Equation 4.80 we find that IW 100(0)1? = l/(za?), so 


5 
1 _ Ho0spe 


= ——_*__~(§, -S,). 6.89 
hf Juno p Se) d 


in the ground state. This is called spin-spin coupling, because it involves the dot 
product of two spins (contrast spin-orbit coupling, which involves S - L). 

In the presence of spin-spin coupling, the individual spin angular momenta 
are no longer conserved; the “good” states are eigenvectors of the total spin, 


S = Se + Sp. [6.90] 


As before, we square this out to get 
1 
Sp -Se = 5 ($^ — S; — Sp [6.91] 


But the electron and proton both have spin 1/2, so $2 = s = (3/4)h*. In the 
triplet state (spins “parallel”) the total spin is 1, and hence $? = 2/7; in the singlet 
state the total spin is 0, and S? = 0. Thus 


E! 4gph^ — [--1/4. (triplet): 


Ex LLL 6.92 
i 3mpmžc?a* | —3/4. (singlet). Vds! 


Spin-spin coupling breaks the spin degeneracy of the ground state, lifting the 
triplet configuration and depressing the singlet (see Figure 6.13). The energy gap 
is evidently 

4gyh* 
Ag 


= —— LL; = 5.88 x 10° ev. 6.93 
3m ,m?c?a* DES E [6.23] 
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FIGURE 6.13: Hyperfine splitting in the ground state of hydrogen. 


The frequency of the photon emitted in a transition from the triplet to the singlet 
state is AE 
dip is 1420 MHz. [6.94] 
1 


and the corresponding wavelength is c/v — 21 cm, which falls in the microwave 
region. This famous 21-centimeter line is among the most pervasive and ubiquitous 
forms of radiation in the universe. 


Problem 6.27 Let a and b be two constant vectors. Show that 
x nem 4 
fe - F)(b - F) sin dé dọ = a - b) [6.95] 


(the integration is over the usual range: 0 < 0 < x, 0 < $ < 2x). Use this result 
to demonstrate that 


r? 


[n FXSe 0) - Sp zi =f) 


for states with / = 0. Hint: F = sinf cos@i + sin sing} + cos Ok. 


Problem 6.28 By appropriate modification of the hydrogen formula, determine 
the hyperfine splitting in the ground state of (a) muonic hydrogen (in which a 
muon—same charge and g-factor as the electron, but 207 times the mass—substi- 
tutes for the electron), (b) positronium (in which a positron—same mass and 
g-factor as the electron, but opposite charge—substitutes for the proton), and 
(c) muonium (in which an anti-muon— same mass and g-factor as a muon, but 
opposite charge — substitutes for the proton). Hint: Don't forget to use the reduced 
mass (Problem 5.1) in calculating the “Bohr radius" of these exotic “atoms.” Inci- 
dentally, the answer you get for positronium (4.82 x 107* eV) is quite far from 
the experimental value (8.41 x 1074 eV); the large discrepancy is due to pair anni- 
hilation (e* + e^ — y + y), which contributes an extra (3/4) AE, and does not 
occur (of course) in ordinary hydrogen, muonic hydrogen, or muonium. 
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FURTHER PROBLEMS FOR CHAPTER 6 


Problem 6.29 Estimate the correction to the ground state energy of hydrogen due 
to the finite size of the nucleus. Treat the proton as a uniformly charged spherical 
shell of radius b, so the potential energy of an electron inside the shell is constant: 
—¢*/ (4m €ob); this isn't very realistic, but it is the simplest model, and it will 
give us the right order of magnitude. Expand your result in powers of tlie small 
parameter (b/a), where a is the Bohr radius, and keep only the leading term, so 
your final answer takes the form 

AE " 

E A(b/a)". 
Your business is to determine the constant A and the power n. Finally, put in 
b ~ 107? m (roughly the radius of the proton) and work out the actual number. 
How does it compare with fine structure and hyperfine structure? 


Problem 6.30 Consider the isotropic three-dimensional harmonic oscillator 
(Problem 4.38). Discuss the effect (in first order) of the perturbation 


H' = Axlyz 
(for some constant À) on 
(a) the ground state; 


(b) the (triply degenerate) first excited state. Hint: Use the answers to 
Problems 2.12 and 3.33. 


* * xProblem 6.31 Van der Waals interaction. Consider two atoms a distance R apart. 


Because they are electrically neutral you might suppose there would be no force 
between them, but if they are polarizable there is in fact a weak attraction. To 
model this system, picture each atom as an electron (mass i, charge —e) attached 
by a spring (spring constant k) to the nucleus (charge +e), as in Figure 6.14. We'll 
assume the nuclei are heavy, and essentially motionless. The Hamiltonian for the 
unperturbed system is 


l 2 1 2 | 2 l 2 
H9 2 —p pkr + — p? + lkxi. 6.96 
m”! * 3 E dA ts " l l 
+e XI. -e +e XI. -e 
X, X» 
EE soma 


FIGURE 6.14: Two nearby polarizable atoms (Problem 6.31). 
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The Coulomb interaction between the atoms is 


poc (5 á a [6.97] 
~ 4ra AVR R-x| Rcx R-—x(ctx»J. 


(a) Explain Equation 6.97. Assuming that |xı| and |x| are both much less than 


R, show that i 
sl EXN 
zx I. 6.98 
2m eg R? [ l 
(b) Show that the total Hamiltonian (Equation 6.96 plus Equation 6.98) separates 
into two harmonic oscillator Hamiltonians: 
Hush TER A TS i| ptu ee s [6.99] 
-|2n?*72 2: «9 R? i 2m^- 2 27 eg R? "Ee S 
under the change of variables 
bt c x3). which entails p L + p) [6.100] 
X+ = HY + NX), Ich en + = —=\Pi p. A 
£ J A 
(c) The ground state energy for this Hamiltonian is evidently 
1 k = (e?/2x eg R?) 
E= ^ (e +w_), where w4 = Ll. [6.101] 
1 
Without thé Coulomb interaction it would have been Eg = fiwo, where wo = 
/k/m. Assuming that k >> (e?/27r eg R?), show that 
2 
h e 1 
AV = E — Eo = —— | — ] —. 6.102 
i Bmw (<=) R6 | 
Conclusion: There is an attractive potential between the atoms, proportional 
to the inverse sixth power of their separation. This is the van der Waals 
interaction between two neutral atoms. 
(d) Now do the same calculation using second-order perturbation theory. Hint: 


The unperturbed states are of the form vy, G1) Vi, (x2), where y,a(x) is a 
one-particle oscillator wave function with mass m and spring constant k; AV 
is the second-order correction to the ground state energy, for the perturbation 
in Equation 6.98 (notice that the first-order correction is zero). 


x xProblem 6.32 Suppose the Hamiltonian H, for a particular quantum system, is 
a function of some parameter A; let E,(A) and v,(A) be the eigenvalues and 
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eigenfunctions of H(A). The Feynman-Hellmann theorem? states that 


OE 0H . 
78 = (w Ein) [6.103] 


(assuming either that E, is nondegenerate, or—if degenerate—that the w,,’s are 
the “good” linear combinations of the degenerate eigenfunctions). 


(a) Prove the Feynman-Hellmann theorem. Hint: Use Equation 6.9. 


(b) Apply it to the one-dimensional harmonic oscillator, (i) using A = w (this 
yields a formula for the expectation value of V), (ii) using A = /i (this yields 
(T)), and (iii) using A = m (this yields a relation between (T) and (V)). 
Compare your answers to Problem 2.12, and the virial theorem predictions 
(Problem 3.31). 


x xProblem 6.33 The Feynman-Hellmann theorem (Problem 6.32) can be used to 
determine the expectation values of 1/r and 1/r? for hydrogen.?? The effective 
Hamiltonian for the radial wave functions is (Equation 4.53) 


n? d? th e? 1 


2m dr2 2m r? 4neor. 


and the eigenvalues (expressed in terms of /)?^ are (Equation 4.70) 
me* 


E me E 
"327262 A? Cjimax +1 + 1)? 


(a) Use A = e in the Feynman-Hellmann theorem to obtain (1/r). Check your 
result against Equation 6.55. 


(b) Use 4 =/ to obtain (1/r?). Check your answer with Equation 6.56. 


x x xProblem 6.34 Prove Kramers’ relation:?? 


- x | 65) — Qs + Datei!) + nO +1)? - $2382 (752) =0, [6.104] 


2? Feynman obtained Equation 6.103 while working on his undergraduate thesis at MIT 
(R. P. Feynman, Phys. Rev. 56. 340. 1939); Hellmann's work was published four years earlier in an 
obscure Russian journal. 


?3C. Sánchez del Rio. Am. J. Phys. 50, 556 (1982): H. S. Valk, Am. J. Phys. 54, 921 (1986). 

?4]m part (b) we treat / as a continuous variable; n becomes a function of /, according to 
Equation 4.67, because jyjax. which must be an integer. is fixed. To avoid confusion, I have eliminated 
n, to reveal the dependence on / explicitly. 

25This is also known as the (second) Pasternack relation. Sce H. Beker, Am. J. Phys. 65. 1118 


(1997). For a proof based on the Feynman-Hellmann theorem (Problem 6.32). see S. Balasubramanian, 
Am. J. Phys. 68, 959 (2000). 
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which relates the expectation values of r to three different powers (s, s — 1, and 
s — 2), for an electron in the state Wyn, of hydrogen. Hint: Rewrite the radial 
equation (Equation 4.53) in the form 


" [I 2 l | 
u = u, 


r2 ar n-a~ 


and use it to express f (ur?u")dr in terms of (r*), (7571), and (r372). Then use 
integration by parts to reduce the second derivative. Show that f(ur*u')dr = 


—(s/2)(r57!), and f'(wr*u')dr = —[2/(s + 1] f u"r**lu') dr. Take it from 
there. 
Problem 6.35 


(a) Plugs = 0, s = 1, s = 2, and s = 3 into Kramers’ relation (Equation 6.104) 
to obtain formulas for (r^!), (r), (r?), and (r?). Note that you could continue 
indefinitely, to find any positive power. 


(b) In the orher direction, however, you hit a snag. Put in s — —1, and show 
that all you get is a relation between (r^?) and (r^). 


(c) But if you can get (r7?) by some other means, you can apply the Kramers’ 
relation to obtain the rest of the negative powers. Use Equation 6.56 (which is 
derived in Problem 6.33) to determine (r^?), and check your answer against 
Equation 6.64. 


x x «Problem 6.36 When an atom is placed in a uniform external electric field Esx, 
the energy levels are shifted—a phenomenon known as the Stark effect (it is the 
electrical analog to the Zeeman effect). In this problem we analyze the Stark effect 
for the n = ] and n = 2 states of hydrogen. Let the field point in the z direction, 
so the potential energy of the electron is 


H; = eEexz = eEcyr cos6. 


Treat this as a perturbation on the Bohr Hamiltonian (Equation 6.42). (Spin is 
irrelevant to this problem, so ignore it, and neglect the fine structure.) 


(a) Show that the ground state energy is not affected by this perturbation, in 
first order. 


(b) The first excited state is 4-fold degenerate: W299, W211, W210, V»j-1. Using 
degenerate perturbation theory, determine the first-order corrections to the 
energy. Into how many levels does E» split? 


(c) What are the "good" wave functions for part (b)? Find the expectation value 
of the electric dipole moment (p, = —er) in each of these "good" states. 


290 Chapter 6 Time-Independent Perturbation Theory 


Notice that the results are independent of the applied field—evidently hydro- 
gen in its first excited state can carry a permanent electric dipole momert. 


Hint: There are a lot of integrals in this problem, but almost all of them are 
zero. So study each one carefully, before you do any calculations: If the $ integral 
vanishes, there's not much point in doing the r and 0 integrals! Partial answer: 
Wi3 = W3; = —3ea Ec; all other elements are zero. 


x x «Problem 6.37 Consider the Stark effect (Problem 6.36) for the n = 3 states of 
hydrogen. There are initially nine degenerate states, W3/m (neglecting spin, as 
before), and we turn on an electric field in the z direction. 


(a) Construct the 9 x 9 matrix representing the perturbing Hamiltonian. Partial 
answer: (300|z|310) = —34/6a, (310|z|320) = —34/3a, (31 1|z]32 1) = 
—(9/2)a. 


(b) Find the eigenvalues, and their degeneracies. 


Problem 6.38 Calculate the wavelength, in centimeters, of the photon emitted 
under a hyperfine transition in the ground state (n — 1) of deuterium. Deuterium 
is "heavy" hydrogen, with an extra neutron in the nucleus; the proton and neutron 
bind together to form a deuteron, with spin 1 and magnetic moment 


2d 


= ——SqQ; 
2mqgq ‘ 


Ha 


the deuteron g-factor is 1.71. 


x * xProblem 6.39 In a crystal, the electric field of neighboring ions perturbs the energy 
levels of an atom. As a crude model, imagine that a hydrogen atom is surrounded 
by three pairs of point charges, as shown in Figure 6.15. (Spin is irrelevant to this 
problem, so ignore it.) 


(a) Assuming that r < dj, r « d», and r « ds, show that 
H' = Vo 38i? + Boy” + B32”) — (Bi + B2 + Br? 


where 
€ di 


E 2. and Vo —2(fid? [2 I2). 
Arey d nd Vo (Bid; + Bad5 + B3d3) 


Bi = 


(b) Find the lowest-order correction to the ground state energy. 


(c) Calculate the first-order corrections to the energy of the first excited states 
(n = 2). Into how many levels does this four-fold degenerate system split, 
(i) in the case of cubic symmetry, 6; = f» = fs; (ii) in the case of 
tetragonal symmetry, 8| = 2 x: £3; (ili) in the general case of orthorhom- 
bic symmetry (all three different)? 
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FIGURE 6.15: Hydrogen atom surrounded by six point charges (crude model for a 
crystal lattice); Problem 6.39. 


x x «Problem 6.40 Sometimes it is possible to solve Equation 6.10 directly, without 
having to expand US in terms of the unperturbed wave functions (Equation 6.11). 
Here are two particularly nice examples. 


(a) Stark effect in the ground state of hydrogen. 
(i) Find the first-order correction to the ground state of hydrogen in the 
presence of a uniform external electric field Eexı (the Stark effect — see 
Problem 6.36). Hint: Try a solution of the form 


(A+ Br+Cr*)e~"/4 cos; 


your problem is to find the constants A, B, and C that solve 
Equation 6.10. 

(i) Use Equation 6.14 to determine the second-order correction to the 
ground state energy (the first-order correction is zero, as you found in 
Problem 6.36(a)). Answer: ~—m(3a7eE ex, /2h)?. 


(b) If the proton had an electric dipole moment p, the potential energy of the 
electron in hydrogen would be perturbed in the amount 


H= epcos@ 


4reor?' 


(i) Solve Equation 6.10 for the first-order correction to the ground state 
wave function. 
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(ii) Show that the total electric dipole moment of the atom is (surprisingly) 
zero, to this order. 

(iti) Use Equation 6.14 to determine the second-order correction to the 
ground state energy. What is the first-order correction? 


CHAPTER 7 


THE VARIATIONAL PRINCIPLE 


7.1 THEORY 


Suppose you want to calculate the ground state energy, Egs, for a system described 
by the Hamiltonian H, but you are unable to solve the (time-independent) 
Schrédinger equation. The variational principle will get you an upper bound for 
Egs, which is sometimes all you need, and often, if you're clever about it, very 
close to the exact value. Here's how it works: Pick any normalized function w 
whatsoever; claim that 


Eg, < (V|H|y) = (H). [7.1] 


That is, the expectation value of H, in the (presumably incorrect) state w is certain 
to overestimate the ground state energy. Of course, if y just happens to be one 
of the excited states, then obviously (H) exceeds Eps; the point is that the same 
holds for any y whatsoever. 

Proof: Since the (unknown) eigenfunctions of H form a complete set, we can 
express y as a linear combination of them:! 


y = > cnn. with Hy, = EnWn. 
n 


Hf thé Hamiltonian admits scattering states. as well as bound states. then we'll need an integral 
as well as a sum. but the argument is unchanged. 
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Since wy is normalized, 


1= (Viv) -[x Cm Vm 


m 


5 
2 cite) = y» 2 Gen (nV) =>: len|”. 
n 


m 


(assuming the eigenfunctions themselves have been orthonormalized: (Vl) = 
mn). Meanwhile, 


e- [X Cn V, | H 


m 


2 Cn tx) = > 2 C Enen (Vm Vs) 2y Enlen l^ 
n 


n 


< En, 


But the ground state energy is, by definition, the smallest eigenvalue, so Egs 
and hence 


(H) = Egs »- iat = Es. 
n 


which is what we were trying to prove. 


Example 7.1 Suppose we want to find the ground state energy for the one- 
dimensional harmonic oscillator: 


Of course, we already know the exact answer, in this case (Equation 2.61): Egs = 
(1/2)fw; but this makes it a good test of the method. We might pick as our “trial” 
wave function the Gaussian, 


w(x) = Ae, [7.2] 


where b is a constant, and A is determined by normalization: 


1/4 
iuf e? dy = [AP Z =a =(2) [7.3] 


Now 


where, in this case, 


h? ER 2d 2 hb 
(T) = -Zar f e™ - (e^ ) dx = ie [7.5] 
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and 
l is 3 10? 
(V) = jme" |Al’ im e P y? dx = ES 
sO n 2 
^b mo 
H) = — + —. 7.6 
vu 2m F 8b [7-6] 


According to Equation 7.1, this exceeds Egs for any b: to get the tightest 
bound, let's minimize (Hy: 


d uH h? mo? | mao 
— — se — ; p = 
db 2m | 8p? 2h 
Putting this back into (H), we find 
l 
(A) min = 5ho. [7.7] 


In this case we hit the ground state energy right on the nose— because (obviously) 
I “just happened" to pick a trial function with precisely the form of the actual 
ground state (Equation 2.59). But the gaussian is very easy to work with, so it's a 
popular trial function, even when it bears little resemblance to the true ground state. 


Example 7.2 Suppose we're looking for the ground state energy of the delta- 
function potential: 


j h^ d? ia 
= —— — -— aó(x). 
2m dx? 
Again, we already know the exact answer (Equation 2.129): Egs = —ma?/2h*. As 


before, we'll use a gaussian trial function (Equation 7.2). We've already determined 
the normalization, and calculated (7); all we need is 


29 2 2b 
(V) = -ala f p Stx) dx = -a |. 
25 m 
Evidently 
h? 2l 
oss ES [7.8] 
2m T 


and we know that this exceeds Eg, for all b. Minimizing it, 


a h? a o n 2in2a? 
db 


= =0 > b= ; 
2m Narb zh 
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So 
? 
mas 


(H) min = NO. 


[7 


which is indeed somewhat higher than Eys. since m > 2. 


I said you can use any (normalized) trial function y whatsoever, and this 
true in a sense. However, for discontinuous functions it takes some fancy footwc 
to assign a sensible meaning to the second derivative (which you need, in order 
calculate (7)). Continuous functions with kinks in them are fair game, howev 
as long as you are careful; the next example shows how to handle them.” 


Example 7.3 Find an upper bound on the ground state energy of the on 
dimensional infinite square well (Equation 2.19), using the “triangular” trial wa 
function (Figure 7.1): 


Ax, if 0 < x < a/2, 
y(x)={ Ala- x). ifa/2<x <a. [7.1 
0. otherwise, 


where A is determined by normalization: 


a/2 a 3 2 [3 
1 = |A? f xdx | (a—x} dx =P => A-2C-J-. [WE 
J0 


a/2 


w(x) 


al2 a x 


FIGURE 7.1: Triangular trial wave function for the infinite square well (Equa 
tion 7.10). 


?For a collection of interesting examples.see W. N. Mei, Int. J. Educ. Sci. Tech. 30. 513 (1999) 


3There is no point in trying a function (such as the gaussian) that "leaks" outside the well 
because you'll get (V) = œ. and Equation 7.1 tells you nothing. 
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FIGURE 7.2: Derivative of the wave function in Figure 7.1. 


In this case 


A. if 0 <x <a/2 
1 , 
A —A. ifa/2«x«a, 
dx 0, otherwise, 


as indicated in Figure 7.2. Now, the derivative of a step function is a delta function 
(see Problem 2.24(b)): 


a = Aó(x) — 2Aô (x — a/2) + Aó(x — a), [7.12] 
and hence 
h?A 
(H) = ES few — 26(x —a/2) + 6(x — a)] w(x) dx 
h*A h?A?a 12h? 
= -5z [V (0) - 24 (a/2) + V ()) = ——;  U13 
m am 2ma- 


The exact ground state energy is Egs = m7h* /2ma? (Equation 2.27), so the theorem 
works (12 > 7). 


The variational principle is extraordinarily powerful, and embarrassingly casy 
to use. What a physical chemist does to find the ground state energy of some 
complicated molecule is write down a trial wave function with a large number of 
adjustable parameters, calculate (H). and tweak the parameters to get the lowest 
possible value. Even if v has little resemblance to the true wave function, you 
often get miraculously accurate values for Egs. Naturally, if you have some way 
of guessing a realistic V, so much the better. The only trouble with the method 
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is that you never know for sure how close you are to the target—all you can be 
certain of is that you've got an upper bound.* Moreover, as it stands, the technique 
applies only to the ground state (see, however, Problem 7.4).> 


xProblem 7.1 Use a gaussian trial function (Equation 7.2) to obtain the lowest 
upper bound you can on the ground state energy of (a) the linear potential: V(x) — 
a|x|; (b) the quartic potential: V (x) = ox*. 


x «Problem 7.2 Find the best bound on Egs for the one-dimensional harmonic oscil- 
lator using a trial wave function of the form 


A 


x24 p 


W(x) = 


where A is determined by normalization and þ is an adjustable parameter. 


Problem 7.3 Find the best bound on Egs for the delta-function potential V(x) = 
—aó(x), using a triangular trial function (Equation 7.10, only centered at the ori- 
gin). This time a is an adjustable parameter. 


Problem 7.4 


(a) Prove the following corollary to the variational principle: If (W|wWgs) = 0, 
then (H) > Efe, where Ere is the energy of the first excited state. 

Thus, if we can find a trial function that is orthogonal to the exact ground 
state, we can get an upper bound on the first excited state. In general, it’s 
difficult to be sure that V is orthogonal to Ygs, since (presumably) we don't 
know the latter. However, if the potential V(x) is an even function of x, 
then the ground state is likewise even, and hence any odd trial function will 
automatically meet the condition for the corollary. 


(b) Find the best bound on the first excited state of the one-dimensional harmonic 
oscillator using the trial function 


w(x) = Axe Pt, 


"In practice this isn't much of a limitation. and there are sometimes ways of estimating the 
accuracy. The ground state helium has been calculated to many significant digits in this way—see for 
example G. W. Drake et al., Phys. Rev. A 65. 054501 (2002) or Vladimir I. Korobov, Phys. Rev. A 66. 
024501 (2002). 

"For a systematic extension of the variational principle to the calculation of excited state ener- 


gics sec. for example. Linus Pauling and E. Bright Wilson, Introduction to Quantum Mechanics, With 
Applications to Chemistry. McGraw-Hill. New York (1935. paperback edition 1985). Section 26. 
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Problem 7.5 


(a) Use the variational principle to prove that first-order non-degenerate pertur- 
bation theory always overestimates (or at any rate never underestimates) the 
ground state energy. 


(b) In view of (a), you would expect that the second-order correction to the 
ground state is always negative. Confirm that this is indeed the case, by 
examining Equation 6.15. 


7.2 THE GROUND STATE OF HELIUM 


The helium atom (Figure 7.3) consists of two electrons in orbit around a nucleus 
containing two protons (also some neutrons, which are irrelevant to our purpose). 
The Hamiltonian for this system (ignoring fine structure and smaller corrections) is: 


2 2 
n=- vv- (i.i... [7.14] 
2m a 4meg \r) r -rl 

Our problem is to calculate the ground state energy, Egs. Physically, this repre- 
sents the amount of energy it would take to strip off both electrons. (Given Egs 
it is easy to figure out the “ionization energy” required to remove a single elec- 
tron—see Problem 7.6.) The ground state energy of helium has been measured to 
great precision in the laboratory: 


Egs = —78.975 eV (experimental). [7.15] 


This is the number we would like to reproduce theoretically. 
It is curious that such a simple and important problem has no known exact 
solution. The trouble comes from the electron-electron repulsion, 


9 
e* l 
= —_ —_—_.. 7.16 
"^. 4zo |r) — r?| dins. 


+2e FIGURE 7.3: The helium atom. 


ÓThere do exist exactly soluble three-body problems with many of the qualitative features of 
helium, but using non-coulombic potentials (see Problem 7.17). 
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If we ignore this term altogether, H splits into two independent hydrogen Hamil- 
tonians (only with a nuclear charge of 2e, instead of e); the exact solution is just 
the product of hydrogenic wave functions: 


8 wee 
Vo(ri. r2) = vioo(t1) Vioo(r2) = Ex eee [7.17] 
and the energy is 8E; = —109 eV (Equation 5.31).’ This is a long way from 


—79 eV, but it’s a start. 

To get a better approximation for Eps we'll apply the variational principle, 
using vo as the trial wave function. This is a particularly convenient choice because 
it’s an eigenfunction of most of the Hamiltonian: 


H yo = (8E| + Vee) Wo. [7.18] 
Thus 
(H) = 8E, + (Vee). [7.19] 
where® , : — 
2 N 8 z ya ry Mr2)/d 
(Vee) = ( d ) (5) | TEU dr, d?r. [7.20] 
4reo) \ra lr; — r?| 


I'll do the rz integral first; for this purpose rı is fixed, and we may as well orient 
the r2 coordinate system so that the polar axis lies along rı (see Figure 7.4). By 


the law of cosines, 
Ir) — r2| = Jn + ry — 2r,ra cos 62. [7.21] 


and hence 


e ^ a/a ela 
h= n= | 
Iri =- rj — p p + ry — 2r|ra COS 05 


The 6$» integral is trivial (277); the 05 integral is 


f sin 6 yr + i — 2rjr2 cos 05 
0 


3 sin 6» drad64dé». [7.22] 


bs 


d05 = 
Fir 


"nu: 3 — 2rir3 cos 6» 0 


. Here a is the ordinary Bohr radius and E, = -13.6/n? eV is the nth Bohr energy: recall 
that for a nucleus with atomic number Z. E, — ZE, and a — a/Z (Problem 4.16). The spin 
configuration associated with Equation 7.17 will be antisymmetric (the singlet). 


8 You can. if you like, interpret Equation 7.20 as first-order perturbation theory. with H’ = Vee. 
However. 1 regard this as a misuse of the method, since the perturbation is comparable in size to the 
unperturbed potential. I prefer, therefore, to think of it as a variational calculation. in which we are 
looking for an upper bound on Egs. 
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X5 


FIGURE 7.4: Choice of coordinates for the r2-integral (Equation 7.20). 


1 
= —— “re +724 2rin — r? +r? —2rire 
rir) - ^ 


1 
Ee (ye) irse | 
r\F2 


2/r|. ifn <r], 


2 
2/r2, ifn >r [7.23] 


Thus 


ri 
3 " 

ze Í i ( k 2) ein J | (7.24] 
T a 


It follows that (Ve) is equal to 


2 8 
(=) (5) J [i ES (1 d z) eme e ^^ lk sin, dridóidéy. 
47t€0 ma a 


The angular integrals are easy (47r), and the rı integral becomes 


co 2r? Sa? 
—4dría _ |. San —8r/a diim. 
i Ir (: + F Je | Ir 128 


Finally, then, 


5/ e 5 
Vee) = — =—-—E, = 34 eV, 7.25 
(Vee) 4a (=) 2 ! : | | 
and therefore 
(H) = —109 eV + 34 eV = —75 eV. [7.26] 


Not bad (remember, the experimental value is —79 eV). But we can do better. 
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We need to think up a more realistic trial function than yo (which treats the 
two electrons as though they did not interact at all). Rather than completely ignoring 
the influence of the other electron, let us say that, on the average, each electron 
represents a cloud of negative charge which partially shields the nucleus, so that 
the other electron actually sees an effective nuclear charge (Z) that is somewhat 
less than 2. This suggests that we use a trial function of the form 


z? , 
Vi(ri.r2) = aoe R [7.27] 


We'll treat Z as a variational parameter, picking the value that minimizes H. (Please 
note that in the variational method we never touch the Hamiltonian itself—the 
Hamiltonian for helium is, and remains, Equation 7.14. But it’s fine to think about 
approximating the Hamiltonian as a way of motivating the choice of the trial wave 
function. 

This wave function is an eigenstate of the “unperturbed” Hamiltonian (neglect- 
ing electron repulsion), only with Z, instead of 2, in the Coulomb terms. With this 
in mind, we rewrite H (Equation 7.14) as follows: 


h? e /Z Z 
H = -— (V? + V3) - — (5^ =) 
2m rr 


47r Eg 
2 
e Z—2 Z—2 1 
(: T + z-u) [7.28] 
4r Eo F] r2 Iri — T2| 
The expectation value of H is evidently 

e? 1 
(H) 2 2Z^ E, +22 -2)/ ))s (Vee). [7.29] 

4z€9/ Ar 


Here (1/r) is the expectation value of 1/r in the (one-particle) hydrogenic ground 
state Wioo (but with nuclear charge Z); according to Equation 6.55, 


H E a [7.30] 


The expectation value of V,, is the same as before (Equation 7.25), except that 
instead of Z = 2 we now want arbitrary Z—so we multiply a by 2/Z: 


SZ/í e 5Z 
Væ) = — | —— | = —-—E}. 7.31 
(Vee) 8a (=) 4 


Putting all this together, we find 


(H) = bz zy G/oZ | E, -[-222)- QT/AZ]E,. — [7.32] 
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According to the variational principle, this quantity exceeds E,, for any value 
of Z. The lowest upper bound occurs when (#7) is minimized: 


d 
77 (H) = [-4Z + QT/4)E, = 0. 


from which it follows that 21 
Z = — = 1.69. 7.33 
T: [7.33] 
This seems reasonable; it tells us that the other electron partially screens the nucleus, 
reducing its effective charge from 2 down to about 1.69. Putting in this value for 
Z, we find 


1 /3\° 
(H)—--lz) E = -77.5 eV. [7.34] 
2 \2 
The ground state of helium has been calculated with great precision in this 
way, using increasingly complicated trial wave functions, with more and more 
adjustable parameters.? But we're within 2% of the correct answer, and, frankly, 
at this point my own interest in the problem begins to wane.!9 


Problem 7.6 Using Egs = —79.0 eV for the ground state energy of helium, cal- 
culate the ionization energy (the energy required to remove just one electron). 
Hint: First calculate the ground state energy of the helium ion, He*, with a single 
electron orbiting the nucleus; then subtract the two energies. 


xProblem 7.7 Apply the techniques of this Section to the H^ and Li* ions (each 
has two electrons, like helium, but nuclear charges Z — 1 and Z — 3, respectively). 
Find the effective (partially shielded) nuclear charge, and determine the best upper 
bound on Egs, for each case. Comment: In the case of H^ you should find that 
(H) > —13.6 eV, which would appear to indicate that there is no bound state at 
all, since it would be energetically favorable for one electron to fly off, leaving 
behind a neutral hydrogen atom. This is not entirely surprising, since the electrons 
are less strongly attracted to the nucleus than they are in helium, and the electron 
repulsion tends to break the atom apart. However, it turns out to be incorrect. With 
a more sophisticated trial wave function (see Problem 7.18) it can be shown that 
Egs < —13.6 eV. and hence that a bound state does exist. It's only barely bound, 


?The classic studies are E. A. Hylleraas, Z. Phys. 65. 209 (1930); C. L. Pekeris, Phys. Rev. 115, 
1216 (1959). For more recent work see footnote 4. 


10The first excited state of helium can be calculated in much the same way. using a trial wave 
function orthogonal to the ground state. See P. J. E. Peebles, Quantum Mechanics, Princeton U.P., 
Princeton. NJ (1992). Section 40. 
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however, and there are no excited bound states,!! so H7 has no discrete spectrum 
(all transitions are to and from the continuum). As a result, it is difficult to study 
in the laboratory, although it exists in great abundance on the surface of the sun.!? 


7.3 THE HYDROGEN MOLECULE ION 


Another classic application of the variational principle is to the hydrogen molecule 
ion, HF, consisting of a single electron in the Coulomb field of two protons 
(Figure 7.5). I shall assume for the moment that the protons are fixed in posi- 
tion, a specified distance R apart, although one of the most interesting byproducts 
of the calculation is going to be the actual value of R. The Hamiltonian is 


n? 3 e? l l 
H = -— V^ — < e [7.35] 
2m 4meg Nr|. ra 


where rı and rz are the distances to the electron from the respective protons. As 
always, our strategy will be to guess a reasonable trial wave function, and invoke 
the variational principle to get a bound on the ground state energy. (Actually, our 
main interest is in finding out whether this system bonds at all —that is, whether 
its energy is less than that of a neutral hydrogen atom plus a free proton. If our 
trial wave function indicates that there is a bound state, a better trial function can 
only make the bonding even stronger.) 

To construct the trial wave function, imagine that the ion is formed by taking 
a hydrogen atom in its ground state (Equation 4.80), 


Yor) = gm [7.36] 


za? 


bringing the second proton in from "infinity," and nailing it down a distance R 
away. If R is substantially greater than the Bohr radius, the electron's wave function 
probably isn't changed very much. But we would like to treat the two protons on 


—e 


+e R +e FIGURE 7.5: The hydrogen molecule ion, He, 


' Robert N. Hill, J. Math. Phys. 18, 2316 (1977). 


I?For further discussion see Hans A. Bethe and Edwin E. Salpeter, Quantum Mechanics of One- 
and Two-Electron Atoms, Plenum. New York (1977). Section 34, 


Section 7.3: The Hydrogen Molecule lon 305 


an equal footing, so that the electron has the same probability of being comin 
with either one. This suggests that we consider a trial function of the form 


V = A[Vo(ri) + Vo(r2)]. 


(Quantum chemists call this the LCAO technique, because we are expressi 
molecular wave function as a linear combination of atomic orbitals.) 
Our first task is to normalize the trial function: 


i= [wer iat |f loro d?r 
+ J hedra? | Vom dr], (7: 


The first two integrals are | (since Wo itself is normalized); the third is more tricky. 
Let 


l ties 
I = (Wo(r1)|Wo(r2)) = a f enon dîr. [7.39] 


Picking coordinates so that proton | is at the origin and proton 2 is on the z axis 
at the point R (Figure 7.6), we have 


ri—r and m= Vyr? + R? — 2rRcos6. [7.40] 
and therefore 


p =l f erra Pear costa 2 ing drdédo. [7.41] 


= Xr? 4 R? - 2r H cos 0 


X 


FIGURE 7.6: Coordinates for the calculation of I (Equation 7.39). 
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The @ integral is trivial (277). To do the 0 integral, let 
y =Vr2+R2—2rRcos6. so that d(y*) = 2y dy = 2rRsin6 dé. 


Then 


* UP ERRORS ] rte 
| g VI ER = irk coha sing qg — xl e M y dy 
0 rR Jy-RI 
= E- D. +R+a)— eV RV (| — R| + a)| i 
: 


The r integral is now straightforward: 


2 i ! 
l= [-«-^^ f (r+ R 4 a)e I, dr + e7 Fla | (R—r+a)rdr 
a-R Jo 0 


x 
+ ef/a | (r— Ra a)e |a, dr. 
R 


Evaluating the integrals, we find (after some algebraic simplification), 


I = g-RIa | + (=) ats (2) | [7.42] 
a 3 \a 


I is called an overlap integral; it measures the amount by which vo(ri) overlaps 
V'o(r2) (notice that it goes to 1 as R — O, and to 0 as R — co). In terms of Z, the 
normalization factor (Equation 7.38) is 


4 l 
|A| = =. [7.43] 
2(1+ I) 
Next we must calculate the expectation value of H in the trial state i. Not- 
ing that 
m e] 
———V? — ——— | volri) = Ero(ri) 

2m 47t eor] 

(where E; = —13.6 eV is the ground state energy of atomic hydrogen)—and the 


same with r2 in place of r; —we have 


h^ s r ee | 
Hy =A E m (- F 3] [Vo(ri) + Vo(rz)) 
2m 4z€9 \ri r2 


2 
e 


l l 
= E\w —A ( ) E T 2) . 


47t € 
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It follows that 


a 


(H)= E2142 ( : ) [iom 


47r ey 


] 


r2? 


1 
Vo(Gri)) +(vo(r1) - 


Vo xy . [7.44] 


I'll let you calculate the two remaining quantities, the so-called direct integral, 


] 
D=a(Wo(ri) = Yo(r1)). [7.45] 
and the exchange integral, 
1 
X =a(yo(ri) P Wo(r2)). [7.46] 
The results (see Problem 7.8) are 
mier aN 2R/a 
D-- (1 y z)* [7.47] 
and 
R —R/a 
X={1l+—Je f [7.48] 
a 


Putting all this together, and recalling (Equations 4.70 and 4.72) that Ej = 
— (e? JAz eg) (1/2a), we conclude: 


[7.49] 


(H) = |o c2 E). 


(14 I) 


According to the variational principle, the ground state energy is less than (H). Of 
course, this is only the electron's energy —there is also potential energy associated 
with the proton-proton repulsion: 


——E). [7.50] 


Thus the total energy of the system, in units of — Ej, and expressed as a function 
of x z R/a, is less than 


(1 — (2/3)x")e™* + (14+ x)e™~ l. [7.51] 


2 
PASSER | FOFO e 


This function is plotted in Figure 7.7. Evidently bonding does occur, for there 
exists a region in which the graph goes below —1, indicating that the energy is less 
than that of a neutral atom plus a free proton (—13.6 eV). It’s a covalent bond, with 
the electron shared equally by the two protons. The equilibrium separation of the 
protons is about 2.4 Bohr radii, or 1.3 A (the experimental value is 1.06 A). The 
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-0.5 


Equilibrium 


FIGURE 7.7: Plot of the function F(x), Equation 7.51, showing existence of a bound 
state (x is the distance between the protons, in units of the Bohr radius). 


calculated binding energy is 1.8 eV, whereas the experimental value is 2.8 eV (the 
variational principle, as always, overestimates the ground state energy —and hence 
underestimates the strength of the bond—but never mind: The essential point was 
to see whether binding occurs at all; a better variational function can only make 
the potential well even deeper. 


xProblem 7.8 Evaluate D and X (Equations 7.45 and 7.46). Check your answers 


against Equations 7.47 and 7.48. 


xxProblem 7.9 Suppose we used a minus sign in our trial wave function 


(Equation 7.37): 
y = A[Vo(ri) — Vo(r2)]. [7.52] 


Without doing any new integrals, find F(x) (the analog to Equation 7.51) for this 
case, and construct the graph. Show that there is no evidence of bonding.? (Since 
the variational principle only gives an upper bound, this doesn't prove that bonding 
cannot occur for such a state, but it certainly doesn't look promising). Comment: 
Actually, any function of the form 


y = A[Vo(ri) + e'?yo(r2)]. [7.53] 


'3Bonding occurs when the electron "prefers" to be between the protons, attracting them inward. 
But the odd linear combination (Equation 7.52) has a node at the center, so it's not surprising that this 
configuration drives the protons apart. 
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has the desired property that the electron is equally likely to be associated with 
either proton. However, since the Hamiltonian (Equation 7.35) is invariant under 
the interchange P: r} <> ro, its eigenfunctions can be chosen to be simultaneously 
eigenfunctions of P. The plus sign (Equation 7.37) goes with the eigenvalue +1, 
and the minus sign (Equation 7.52) with the eigenvalue —1; nothing is to be gained 
by considering the ostensibly more general case (Equation 7.53), though you’re 
welcome to try it, if you’re interested. 


x x xProblem 7.10 The second derivative of F(x), at the equilibrium point, can be 
used to estimate the natural frequency of vibration (w) of the two protons in the 
hydrogen molecule ion (see Section 2.3). If the ground state energy (Aw/2) of this 
oscillator exceeds the binding energy of the system, it will fly apart. Show that in 
fact the oscillator energy 1s small enough that this will not happen, and estimate 
how many bound vibrational levels there are. Note: You're not going to be able 
to obtain the position of the minimum-—still less the second derivative at that 
point—analytically. Do it numerically, on a computer. 


FURTHER PROBLEMS FOR CHAPTER 7 


Problem 7.11 


(a) Use a trial wave function of the form 


„~ _ | Acos(zx/a), if (—a/2 <x < a/2), 
yo = | 0 otherwise 


to obtain a bound on the ground state energy of the one-dimensional harmonic 
oscillator. What is the “best” value of a? Compare (H)min with the exact 
energy. Note: This trial function has a “kink” in it (a discontinuous derivative) 
at +a/2; do you need to take account of this, as I did in Example 7.3? 


(b) Use v (x) = Bsin(zx/a) on the interval (—a, a) to obtain a bound on the 
first excited state. Compare the exact answer. 


x *Problem 7.12 


(a) Generalize Problem 7.2, using the trial wave function!4 


(Ose = 
moe (x? d: b2)n x 


HW, N. Mei, Int. J. Educ. Sci. Tech. 27, 285 (1996). 
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for arbitrary n. Partial answer: The best value of b is given by 


po [nene 


= mo 2(2n + 1) 


(b) Find the least upper bound on the first excited state of the harmonic oscillator 
using a trial function of the form 


Bx 


V (x) = (x2 + b2)" ` 


Partial answer: The best value of b is given by 


— mo 2n 4- 1) 


po^ [ne enm 


(c) Notice that the bounds approach the exact energies as n — oo. Why is 
that? Hint: Plot the trial wave functions for n — 2, n — 3, and n — 4, and 
compare them with the true wave functions (Equations 2.59 and 2.62). To do 
it analytically, start with the identity 


e = lim (1+ 3E 


hoo 


Problem 7.13 Find the lowest bound on the ground state of hydrogen you can get 
using a gaussian trial wave function 


V(r) = Ae", 


where A is determined by normalization and b is an adjustable parameter. Answer: 
—11.5 eV. 


x «Problem 7.14 If the photon had a nonzero mass (m, Æ 0), the Coulomb potential 
would be replaced by the Yukawa potential, 


2 e" 


V(r) = —- 


[7.54] 
4men r 

where u = myc/h. With a trial wave function of your own devising, estimate the 

binding energy of a “hydrogen” atom with this potential. Assume ua < 1, and 

give your answer correct to order (ua)?. 


Problem 7.15 Suppose you're given a quantum system whose Hamiltonian Ho 
admits just two eigenstates, Wa (with energy E,,), and Yp (with energy Ep). They 
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are orthogonal, normalized, and nondegenerate (assume E, is the smaller of the two 
energies). Now we turn on a perturbation H’, with the following matrix elements: 


(Val H’ Wa) = (Vp | H'| Vo) = 0: (WalH'|Wo) = (Wol H’ Ipa) =h. [7.55] 
where h is some specified constant. 


(a) Find the exact eigenvalues of the perturbed Hamiltonian. 


(b) Estimate the energies of the perturbed system using second-order perturbation 
theory. 


(c) Estimate the ground state energy of the perturbed system using the variational 
principle, with a trial function of the form 


V = (cos) Wa + (sing) Wn. [7.56] 


where @ is an adjustable parameter. Note: Writing the linear combination in 
this way is just a neat way to guarantee that y is normalized. 


(d) Compare your answers to (a), (b), and (c). Why is the variational principle 
So accurate, in this case? 


Problem 7.16 As an explicit example of the method developed in Problem 7.15, 
consider an electron at rest in a uniform magnetic field B = B-k, for which the 
Hamiltonian is (Equation 4.158): 

eB- 

Ho = — S.. [7.57] 

ni 
The eigenspinors, x, and xy. and the corresponding energies, E, and Ep, are given 
in Equation 4.161. Now we turn on a perturbation, in the form of a uniform field 
in the x direction: 


H'’=—S5,. [7.58] 
(a) Find the matrix elements of H’, and confirm that they have the structure of 
Equation 7.55. What is A? 


(b) Using your result in Problem 7.15(b), find the new ground state energy, in 
second-order perturbation theory. 


(c) Using your result in Problem 7.15(c), find the variational principle bound on 
the ground state energy. 


x x «Problem 7.17 Although the Schrödinger equation for helium itself cannot be 
solved exactly, there exist "helium-like" systems that do admit exact solutions. 
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A simple example!> is “rubber-band helium." in which the Coulomb forces are 
replaced by Hooke's law forces: 


h? 1 A 
H= = (V? + Vi) + Emo (r? + r2) — —mo"|r — r|?. [7.59] 
2m R 2 " 4 
(a) Show that the change of variables from rj, r2, to 


[7.60] 


Il 
1 
= 
“| 
NA 
— 


l 
u = — (ri +r2) v 
2 


V 


turns the Hamiltonian into two independent three-dimensional harmonic oscil- 
lators: 


n? za o1 79 h? > 1 22 
H = KC + FURE + So a + ; nd Anw" v" . [7.61] 


(b) What is the exact ground state energy for this system? 


(c) If we didn't know the exact solution, we might be inclined to apply the 
method of Section 7.2 to the Hamiltonian in its original form (Equation 7.59). 
Do so (but don't bother with shielding). How does your result compare with 
the exact answer? Answer: (H) = 3ho(1 — 4/4). 


x x xProblem 7.18 In Problem 7.7 we found that the trial wave function with shield- 
ing (Equation 7.27), which worked well for helium, is inadequate to confirm the 
existence of a bound state for the negative hydrogen ion. Chandrasekhar'® used a 
trial wave function of the form 


V (r1. r2) = A[Vi (ri Via (02) + Va! va 02]. [7.62] 


Zi Z; 5 
Wir) = ae and y(r) = -— Na. [7.63] 


In effect, he allowed two different shielding factors, suggesting that one electron is 
relatively close to the nucleus, and the other is farther out. (Because electrons are 
identical particles, the spatial wave function must be symmetrized with respect to 
interchange. The spin state— which is irrelevant to the calculation—is evidently 


where 


Eor a more sophisticated model. see R. Crandall. R. Whitnell. and R. Bettega, Am. J. Phys. 52, 
438 (1984). 


16S. Chandrasekhar. Astrophys. J. 100. 176 (1944). 
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antisymmetric.) Show that by astute choice of the adjustable parameters Z; and Z2 
you can get (H) less than —13.6 eV. Answer: 


Ei 8 Pod oor low dogs. dog 
Hae 6 (^ 4 2x TX y E y E y tg" — 5% : 


where x = Zi + Z2 and y = 2./Z; Z2. Chandrasekhar used Z; = 1.039 (since 
this is larger than 1, the motivating interpretation as an effective nuclear charge 
cannot be sustained, but never mind—it’s still an acceptable trial wave function) 
and Z2 = 0.283. 


Problem 7.19 The fundamental problem in harnessing nuclear fusion is getting 
the two particles (say, two deuterons) close enough together for the attractive (but 
short-range) nuclear force to overcome the Coulomb repulsion. The "bulldozer" 
method is to heat the particles up to fantastic temperatures, and allow the random 
collisions to bring them together. A more exotic proposal is muon catalysis, in 
which we construct a "hydrogen molecule ion," only with deuterons in place of 
protons, and a muon in place of the electron. Predict the equilibrium separation 
distance between the deuterons in such a structure, and explain why muons are 
superior to electrons for this purpose." 


x x xProblem 7.20 Quantum dots. Consider a particle constrained to move in two 
dimensions in the cross-shaped region shown in Figure 7.8. The "arms" of the 
cross continue out to infinity. The potential is zero within the cross, and infinite in 
the shaded areas outside. Surprisingly, this configuration admits a positive-energy 
bound state.!? 


(a) Show that the lowest energy that can propagate off to infinity is 


mh 


8na? 


Ethreshold = : 

any solution with energy less than that has to be a bound state. Hint: Go way 
out one arm (say, x >> a), and solve the Schrödinger equation by separation 
of variables; if the wave function propagates out to infinity, the dependence 
on x must take the form exp(ik;x) with ką > 0. 


\7 The classic paper on muon-catalyzed fusion is J. D. Jackson. Phys. Rev. 106, 330 (1957): for a 
more recent popular review, see J. Rafelski and S. Jones, Scientific American, November 1987, page 84. 


IS This model is taken from R. L. Schult et al., Phys. Rev. B 39, 5476 (1989). In the presence 
of quantum tunneling a classically bound state becomes unbound; this is the reverse: A classically 
unbound state is quantum mechanically bound. 
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FIGURE 7.8: The cross-shaped region for Problem 7.20. 


(b) Now use the variational principle to show that the ground state has energy less 
than Ethreshola. Use the following trial wave function (suggested by Krishna 
Rajagopal): 


(1—|xyl/a*)e*, | |x| £a and |y| <a 
(1— |x|/a)e" 95/2, |x| <a and |y| >a 
(1—|yl/a)e*^V^, |x| 2 a and |y| <a 
0, elsewhere. 


V(x.»)—4A 


Normalize it to determine A, and calculate the expectation value of H. 
Answer: 


3h? (a? -2a 4-3 
an - (EL). 


^ ma? 6 4- 11a 


Now minimize with respect to o, and show that the result is less than 
Eireshola. Hint: Take full advantage of the symmetry of the problem— you 
only need to integrate over 1/8 of the open region, since the other 7 inte- 
grals will be the same. Note however that whereas the trial wave function is 
continuous, its derivatives are not —there are “roof-lines” at x = 0, y = 0, 
x = ta, and y = ta, where you will need to exploit the technique of 
Example 7.3. 


CHAPTER 8 


THE WKB APPROXIMATION 


The WKB (Wentzel, Kramers, Brillouin)! method is a technique for obtaining 
approximate solutions to the time-independent Schródinger equation in one dimen- 
sion (the same basic idea can be applied to many other differential equations, and 
to the radial part of the Schrédinger equation in three dimensions). It is particu- 
larly useful in calculating bound state energies and tunneling rates through potential 
barriers. 

The essential idea is as follows: Imagine a particle of energy E moving through 
a region where the potential V(x) is constant. If E > V, the wave function is of 
the form 


W(x) = Ae}, with k & /2m(E — V)/h. 


The plus sign indicates that the particle is traveling to the right, and the minus sign 
means it is going to the left (the general solution, of course, is a linear combination 
of the two). The wave function is oscillatory, with fixed wavelength (A = 27x / k) and 
unchanging amplitude (A). Now suppose that V (x) is not constant, but varies rather 
slowly in comparison to A, so that over a region containing many full wavelengths 
the potential is essentially constant. Then it is reasonable to suppose that y remains 
practically sinusoidal, except that the wavelength and the amplitude change slowly 
with x. This is the inspiration behind the WKB approximation. In effect, it identifies 
two different levels of x-dependence: rapid oscillations, modulated by gradual 
variation in amplitude and wavelength. 


lIn Holland it's KWB. in France it’s BWK. and in England it's JWKB (for Jeffreys). 
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By the same token, if E < V (and V is constant), then y is exponential: 


v(x) = Aet". with « = /2m(V — E)/h. 


And if V(x) is not constant, but varies slowly in comparison with 1/x, the solu- 
tion remains practically exponential, except that A and x are now slowly-varying 
functions of x. 

Now, there is one place where this whole program is bound to fail, and that is 
in the immediate vicinity of a classical turning point, where E œ% V. For here A (or 
1/«) goes to infinity, and V(x) can hardly be said to vary “slowly” in comparison. 
As we shall see, a proper handling of the turning points is the most difficult aspect 
of the WKB approximation, though the final results are simple to state and easy to 
implement. 


8.1 THE “CLASSICAL” REGION 


The Schródinger equation, 


n? d^ 
—— V =E 
ma Oe y 
can be rewritten in the following way: 
dy Pp 
dS = gil [8.1] 
where 
p(x) = y2m[E — V(x)] [8.2] 


is the classical formula for the momentum of a particle with total energy E and 
potential energy V (x). For the moment, I'll assume that E > V (x), so that p(x) 
is real; we call this the “classical” region, for obvious reasons—classically the 
particle is confined to this range of x (see Figure 8.1). In general, y is some 
complex function; we can express it in terms of its amplitude, A(x). and its phase, 
$ (x) — both of which are real: 


y (x) = A(x) ei? [8.3] 


Using a prime to denote the derivative with respect to x, we find: 


d : 
dV _ a ti Ade’? 
dx 


and : 
d" / . " / i 
“5 = [A" + 2i A'9' + iAG" — AY e’. [8.4] 
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V(x) 


Turning points 


Classical region 
FIGURE 8.1: Classically, the particle is confined to the region where E > V(x). 
Putting this into Equation 8.1: 


p? 


A" £ 2i 4d! +i Ad" — AQ = — 


A. [8.5] 


This is equivalent to two real equations, one for the real part and one for the 
imaginary part: 


2 2 
A" — A'Y = eur, or A” =A lor = 5| ; [8.6] 
and ; 
2A'à! + Ab" =0. or C) =a), [8.7] 


Equations 8.6 and 8.7 are entirely equivalent to the original Schródinger 
equation. The second one is easily solved: 


C 
Ap =C. or A=, i 

$ Jd [8.8] 
where C is a (real) constant. The first one (Equation 8.6) cannot be solved in 
general — so here comes the approximation: We assume that the amplitude A varies 
slowly, so that the A" term is negligible. (More precisely, we assume that A"/A 
is much less than both ($^)? and p^[h?.) In that case we can drop the left side of 
Equation 8.6, and we are left with 
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and therefore 


l 
o(x) = t. | p(x) dx. [8.9] 


(Lll write this as an indefinite integral, for now—any constants can be absorbed 
into C, which may thereby become complex.) It follows that 


ot nt pdx [8.10] 


kd 


and the general (approximate) solution will be a linear combination of two such 
terms, one with each sign. 


Notice that ! 
C Pus 

I Gor a EF. 
p(x) 


[8.11] 


which says that the probability of finding the particle at point x is inversely pro- 
portional to its (classical) momentum (and hence its velocity) at that point. This is 
exactly what you would expect—the particle doesn't spend long in the places where 
it is moving rapidly, so the probability of getting caught there is small. In fact, the 
WKB approximation is sometimes derived by starting with this “semi-classical” 
observation, instead of by dropping the A" term in the differential equation. The 
latter approach is cleaner mathematically, but the former offers a more plausible 
physical rationale. 


Example 8.1 Potential well with two vertical walls. Suppose we have an infi- 
nite square well with a bumpy bottom (Figure 8.2): 


Vi | some specified function, if0 « x <a, [8.12] 


OO. otherwise. 


V(x) 


a x 


FIGURE 8.2: Infinite square well with a bumpy bottom. 
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Inside the well (assuming E > V(x) throughout) we have 


1 
V p(x) 


w(x) = [e 41009 + Caer PR : 


or, more conveniently, 


w(x) = —3 [Ci sin $ (x) + C2cos$ (x)]. [8.13] 


where (exploiting the freedom noted earlier to impose a convenient lower limit on 
the integral) 


1 X 
o(x) = = Í p(x’) dx. [8.14] 
h Jo 


Now y (x) must go to zero at x = 0, and therefore (since $ (0) = 0) C» = 0. Also, 
V (x) goes to zero at x — a, so 


ó(a) 2nz (n=1,2.3,...). [8.15] 
Conclusion: 


f p(x) dx =nnħ. [8.16] 
0 


This quantization condition determines the (approximate) allowed energies. 
For instance, if the well has a flat bottom (V(x) = 0), then p(x) = /2mE (a 
constant), and Equation 8.16 says pa = nah, or 


9.923 
n n h- 
2ma? 


which is the old formula for the energy levels of the infinite square well 
(Equation 2.27). In this case the WKB approximation yields the exact answer (the 
amplitude of the true wave function is constant, so dropping A” cost us nothing). 


xProblem 8.1 Use the WKB approximation to find the allowed energies (E,,) of 
an infinite square well with a “shelf,” of height Vo extending half-way across 
(Figure 6.3): 


Vo. 1fO<x <a/2. 
V(x)21(0, ifa/2<x* <a, 
oo, otherwise. 
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Express your answer in terms of Vo and E? = (nz h)? /2ma? (the nth allowed 
energy for the infinite square well with no shelf). Assume that E? > Vo, but do 
not assume that E, >> Vo. Compare your result with what we got in Example 6.1 
using first-order perturbation theory. Note that they are in agreement if either Vo is 
very small (the perturbation theory regime) or n is very large (the WKB— semi- 
classical — regime). 


xxProblem 8.2 An illuminating alternative derivation of the WKB formula 
(Equation 8.10) is based on an expansion in powers of A. Motivated by the free- 
particle wave function, V = A exp(tipx/h), we write 


w(x) = eif OR. 


where f(x) is some complex function. (Note that there is no loss of generality 
here—any nonzero function can be written in this way.) 


(a) Put this into Schrédinger’s equation (in the form of Equation 8.1), and show 
that 


inf" — q'Y +p =0. 
(b) Write f(x) as a power series in 7: 
f(x) = folx) - RÁiGO +A fa) s. 
and, collecting like powers of A, show that 
(fo =P. if =2foA. di-2ffutD. ec 


(c) Solve for fo(x) and f(x), and show that—to first order in H—you recover 
Equation 8.10. 


Note: The logarithm of a negative number is defined by In(—z) = In(z) + inr, 


where n is an odd integer. If this formula is new to you, try exponentiating both 
sides, and you'll see where it comes from. 


82 TUNNELING 


So far, I have assumed that E > V, so p(x) is real. But we can easily write down 
the corresponding result in the nonclassical region (E < V)—it’s the same as 
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before (Equation 8.10), only now p(x) is imaginary:? 


OC  QilfIpenldx. [8.17] 


Consider, for example, the problem of scattering from a rectangular barrier 
with a bumpy top (Figure 8.3). To the left of the barrier (x « 0), 


y(x) = Ae + Bei. [8.18] 


where A is the incident amplitude, B is the reflected amplitude, and k = /2mE/h 
(see Section 2.5). To the right of the barrier (x > a), 


y) = Fe: [8.19] 


F is the transmitted amplitude, and the transmission probability is 


|F|? 
T= AL" [8.20] 
In the tunneling region (0 < x < a), the WKB approximation gives 
U(x) m ueber 4 Pepe pede’ quaj 


VIpool IPODI 


FIGURE 8.3: Scattering from a rectangular barrier with a bumpy top. 


?In this case the wave function is real, and the analogs to Equations 8.6 and 8.7 do not follow 
necessarily from Equation 8.5, although they are still sufficient. If this bothers you, study the alternative 
derivation in Problem 8.2. 
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FIGURE 8.4: Qualitative structure of the wave function, for scattering from a high, 
broad barrier. 


If the barrier is very high and/or very wide (which is to say, if the probability 
of tunneling is small), then the coefficient of the exponentially increasing term (C) 
must be small (in fact, it would be zero if the barrier were infinitely broad), and 
the wave function looks something like? Figure 8.4. The relative amplitudes of the 
incident and transmitted waves are determined essentially by the total decrease of 
the exponential over the nonclassical region: 


La 
[Al 


2 Áo Apu dx! 


so that 


[8.22] 


Example 8.2 Gamow’s theory of alpha decay.^ In 1928, George Gamow (and, 
independently, Condon and Gurney) used Equation 8.22 to provide the first success- 
ful explanation of alpha decay (the spontaneous emission of an alpha-particle—two 
protons and two neutrons — by certain radioactive nuclei).? Since the alpha particle 
carries a positive charge (2e), it will be electrically repelled by the leftover nucleus 
(charge Ze), as soon as it gets far enough away to escape the nuclear binding 
force. But first it has to negotiate a potential barrier that was already known (in the 
case of uranium) to be more than twice the energy of the emitted alpha particle. 
Gamow approximated the potential energy by a finite square well (representing 
the attractive nuclear force), extending out to rı (the radius of the nucleus), joined 


This heuristic argument can be made more rigorous—see Problem 8.10. 

"For a more complete discussion. and alternative formulations. see Barry R. Holstein, 
Am. J. Phys. 64, 1061 (1996). 

>For an interesting brief history see Eugen Merzbacher, "The Early History of Quantum Tun- 
neling." Physics Today. August 2002. p. 44. 
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Coulomb repulsion 


Nuclear binding 


FIGURE 8.5: Gamow’s model for the potential energy of an alpha particle in a 
radioactive nucleus. 


to a repulsive coulombic tail (Figure 8.5), and identified the escape mechanism as 
quantum tunneling (this was, by the way, the first time that quantum mechanics 
had been applied to nuclear physics). 

If E is the energy of the emitted alpha particle, the outer turning point (72) is 
determined by 
| 22e*.. 
4meo n 


E. [8.23] 


The exponent y (Equation 8.22) is evidently? 


D f" 1 2ze V2mE (^ Jr 
v=] 2m ( i -E)ar= z Í J2 = ar. 
h JA 4ré9 r h n r 


The integral can be done by substitution (let r = r2 sin? x), and the result is 


,-YTR(I-s44)-/ne-mp. ^ c2 


h 2 


Typically, ri « r2, and we can simplify this result using the small angle approxi- 
mation (sin e = e): 


~ VàmE [7t Z 
y= 5 [72 vrm] = Ki KaVZn. [8.25] 


9 [n this case the potential does not drop to zero on the left side of the barrier (moreover, this is 
really a three-dimensional problem). but the essential idea. contained in Equation 8.22. is all we really 
need. 
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where : l 
2 2r 
Kies ( e Zy — 1.980 Mev!/2, [8.26] 
47r eo h 
and " 
2 E 
ps ( : AVM 1485 fm ^^. [8.27] 
47r € h 


[One fermi (fm) is 107!? m, which is about the size of a typical nucleus.) 

If we imagine the alpha particle rattling around inside the nucleus, with an 
average velocity v, the average time between “collisions” with the “wall” is about 
2r| /v, and hence the frequency of collisions is v/2r,. The probability of escape at 
each collision is e^? , so the probability of emission, per unit time, is (v/ 2r)e72”, 
and hence the lifetime of the parent nucleus is about 


e”, [8.28] 


Unfortunately, we don’t know v—but it hardly matters, for the exponential factor 
varies over a fantastic range (twenty-five orders of magnitude), as we go from 
one radioactive nucleus to another: relative to this the variation in v is pretty: 
insignificant. In particular, if you plot the logarithm of the experimentally measured 
lifetime against 1/4/E, the result is a beautiful straight line (Figure 8.6),’ just as 
you would expect from Equations 8.25 and 8.28. 


xProblem 8.3 Use Equation 8.22 to calculate the approximate transmission prob- 
ability for a particle of energy E that encounters a finite square barrier of height 
Vo > E and width 2a. Compare your answer with the exact result (Problem 2.33), 
to which it should reduce in the WKB regime 7 « 1. 


x «Problem 8.4 Calculate the lifetimes of U2*® and Po?!?, using Equations 8.25 and 
8.28. Hint: The density of nuclear matter is relatively constant (i.e., the same for 
all nuclei), so (r1)? is proportional to A (the number of neutrons plus protons). 
Empirically, 

rı = (1.07 fm)Al?, [8.29] 


From David Park. Introduction to the Quantum Theory, 3rd ed., McGraw-Hill (1992); it was 
adapted [rom I. Perlman and J. O. Rasmussen, “Alpha Radioactivity,” Encyclopedia of Physics. Vol. 42, 
Springer (1957). This material is reproduced with permission of The McGraw-Hill Companies. 
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logi9T1/2 [years] 


E [MeV] 


FIGURE 8.6: Graph of the logarithm of the lifetime versus 1/4/E (where E is the 
energy of the emitted alpha particle), for uranium and thorium. 


The energy of the emitted alpha particle can be deduced by using Einstein's formula 
(E = me): 
E= mpe? — mac — Myc. [8.30] 


where m p is the mass of the parent nucleus, mz, is the mass of the daughter nucleus, 
and mg is the mass of the alpha particle (which is to say, the He* nucleus). To 
figure out what the daughter nucleus is, note that the alpha particle carries off two 
protons and two neutrons, so Z decreases by 2 and A by 4. Look up the relevant 
nuclear masses. To estimate v, use E = (1 /2)mgv?; this ignores the (negative) 
potential energy inside the nucleus, and surely underestimates v, but it’s about the 
best we can do at this stage. Incidentally, the experimental lifetimes are 6 x 10? 
yrs and 0.5 us, respectively. 


8.3 THE CONNECTION FORMULAS 


In the discussion so far I have assumed that the “walls” of the potential well (or 
the barrier) are vertical, so that the “exterior” solution is simple, and the boundary 
conditions trivial. As it turns out, our main results (Equations 8.16 and 8.22) are 
reasonably accurate even when the edges are not so abrupt (indeed, in Gamow's 
theory they were applied to just such a case). Nevertheless, it is of some interest 
to study more closely what happens to the wave function at a turning point (E — 
V), where the "classical" region joins the "nonclassical" region, and the WKB 


326 


Chapter 8 


The WKB Approximation 


Linearized 
potential 
U(x) 


Turning 
point 


Patching 
region 


Classical 0 Nonclassical 
region region 


FIGURE 8.7: Enlarged view of the right-hand turning point. 


approximation itself breaks down. In this section I'll treat the bound state problem 

(Figure 8.1); you get to do the scattering problem for yourself (Problem 8.10). 
For simplicity, let's shift the axes over so that the right-hand turning point 

occurs at x = 0 (Figure 8.7). In the WKB approximation, we have | 


Beh ^ ! pc) dx' pge k a ifx <0 
oui ’ . , 


W(x) = [8.31] 


A Ipod da if x » 0. 
Tm 


(Assuming V (x) remains greater than E for all x > 0, we can exclude the positive 
exponent in this region, because it blows up as x — oo.) Our task is to join the 
two solutions at the boundary. But there is a serious difficulty here: In the WKB 
approximation, w goes to infinity at the turning point (where p(x) — 0). The true 
wave function, of course, has no such wild behavior—as anticipated, the WKB 
method simply fails in the vicinity of a turning point. And yet, it is precisely the 
boundary conditions at the turning points that determine the allowed energies. What 
we need to do, then, is splice the two WKB solutions together, using a “patching” 
wave function that straddles the turning point. 

Since we only need the patching wave function (Wp) in the neighborhood of 
the origin, we'll approximate the potential by a straight line: 


V(x) € E+V'(0)x. [8.32] 


8 Warning: The following argument is quite technical, and you may wish to skip it on a first 
reading. 
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and solve the Schródinger for this linearized V: 


n? d^y, 
—— E + V'(O)x]v,, = Ev,. 
2m dx? XE AE, Vp. 
Or F 
d^yp 
uà OU 
where iA 
2m $ 


Z= Qx 
so that : 
dp | 
2:3 704A 
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[8.33] 


[8.34] 


[8.35] 


[8.36] 


This is Airy's equation, and the solutions are called Airy functions.? Since the 
Airy equation is a second-order differential equation, there are two linearly inde- 


pendent Airy functions, Ai (z) and Bi(z). 


TABLE 8.1: Some properties of the Airy functions. 


*Boxan hans d 
Differential Equation : 572 
Solutions: Linear combinations of Airy Functions, Aí(z) and Bi(z). 


a 3 
Integral Representation: — Ai(z) — ij cos (S T s:) ds, 
0 


3 
U eae s? 
Bi(z)= — les ME Te + 3 
Tjo 3 
Asymptotic Forms: 


: l xA 7 l . [2 ABS 
Ai(z)-——-—4€3- Ai(z) ~ aeies -+ d 
54m 1H MA 3 4 
2xRz z0: vn(-z) 
MIR l ile "EN l E3 BESSER 
Bi(z) Je ea Bi(z) Waco cos x) z] 


z0. 


Q . . . -. n 

? Classically. a linear potential means a constant force, and hence a constant acceleration — the 
simplest nontrivial motion possible. and the starting point for elementary mechanics. It is ironic that 
the same potential in quantum mechanics gives rise to unfamiliar transcendental functions. and plays 


only a peripheral role in the theory. 


328 


Chapter 8 The WKB Approximation 


FIGURE 8.8: Graph of the Airy functions. 


They are related to Bessel functions of order 1/3; some of their properties are 
listed in Table 8.1 and they are plotted in Figure 8.8. Evidently the patching wave 
function is a linear combination of Ai(z) and Bi(z): 


Vp(x) = aAi(ax) + bBi(ax). [8.37] 


for appropriate constants a and b. 

Now Wp is the (approximate) wave function in the neighborhood of the origin: 
our job is to match it to the WKB solutions in the overlap regions on either side 
(see Figure 8.9). These overlap zones are close enough to the turning point that the 
linearized potential is reasonably accurate (so that w, is a good approximation to 
the true wave function), and yet far enough away from the turning point that the 
WKB approximation is reliable.!? In the overlap regions Equation 8.32 holds, and 
therefore (in the notation of Equation 8.34) 


p(x) € J2m(E — E — V'(0)x) = hat? Sox. [8.38] 


In particular, in overlap region 2, 


X x , 
i Ip) dx! = hai? | Vx dx! = zh), 
0 0 3 


lOThis is a delicate double constraint, and it is possible to concoct potentials so pathological that 
no such overlap region exists. However, in practical applications this seldom occurs. Sec Problem 8.8. 
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Overlap 1 Turning ^ Overlap 2 ^ 
point 
Patching 
region 

FIGURE 8.9: Patching region and the two overlap zones. 
and therefore the WKB wave function (Equation 8.31) can be written as 

D S uda 

w(x) = ——L À e 0n. [8.39] 


Jhos/4x1/4 


Meanwhile, using the large-z asymptotic forms ^ of the Airy functions (from 
Table 8.1), the patching wave function (Equation 8.37) in overlap region 2 becomes 


11 


a EPERE TA) b RE 
E ,7 3(oaxy" 3 (ax) , 8.40 
voe) 2/2 (ax)1/A* s Jirar) A" a 
Comparing the two solutions, we see that 
4 
a= ZD, and b=0. [8.41] 
1 


Now we go back and repeat the procedure for overlap region 1. Once again, 
p(x) is given by Equation 8.38, but this time x is negative, so 


0 2 : 
J p^) dx! & hax? [8.42] 


and the WKB wave function (Equation 8.31) is 


l 


w(x) = Ra) 


Ln + MM l [8.43] 


H Ar first glance it seems absurd to use a /arge-z approximation in this region, which after all is 
supposed to be reasonably close to the turning point at z = 0 (so that the linear approximation to the 
potential is valid). But notice that the argument here is ax. and if you study the matter carefully (see 
Problem 8.8) you will find that there is (typically) a region in which ox is large, but at the same time 
it is reasonable to approximate V (x) by a straight line. 
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Meanwhile, using the asymptotic form of the Airy function for large negative z 
(Table 8.1), the patching function (Equation 8.37. with b = 0) reads 


Vp) = sin S (Ca + - 
ea Ji (-ax)/A 4 


a , 22,2 372 iA aizilLhm,pwu3 
EE ud [inh LIE UL E |: [8.44] 
Jm(—ax)!/4 2i 


Comparing the WKB and patching wave functions in overlap region 1, we find 


a irt T ELLE C ; 
2i [T Vha 2i sn ħa 
or, putting in Equation 8.41 for a: 
B = —ie"" p, and C =ie!*/4D. [8.45] 


These are the so-called connection formulas, joining the WKB solutions at either 
side of the turning point. We're done with the patching wave function now— its 
only purpose was to bridge the gap. Expressing everything in terms of the one 
normalization constant D, and shifting the turning point back from the origin to an 
arbitrary point x2, the WKB wave function (Equation 8.31) becomes 


2D sin| = pe ds + 7 | if x <x 
EZ x X JAX =|. / X2; 
JPG) End, 4 å 
VQ) Hi a [8.46] 


e|; f. ipa. if x x. 
JG 


Example 8.3 Potential well with one vertical wall. Imagine a potential well 
that has one vertical side (at x = 0) and one sloping side (Figure 8.10). In this 
case y (0) = 0, so Equation 8.46 says 


L Í ) ( 122 73 A. 
Or 


[8.47] 


X23 1 
f p(x)dx = (» — z) nh. 
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FIGURE 8.10: Potential well with one vertical wall. 


For instance, consider the “half-harmonic oscillator,” 
if x > 0. [8.48] 


otherwise. 


NS 
—mo x. 


2 
0. 


In this case 
p(x) = yJ2m[E — (1/2)mo?x?] = moy: A 
where 
] /2E 
x2 = — | — 
w m 
tE 


is the turning point. So 
X2 X23 = T 5 
| p(x) dx = mo | x5 — x? dx = —mox3 = —. 
0 0 4 = 2w 


and the quantization condition (Equation 8.47) yields 
] 3 7 dH 
En = (m — z) ho = ( — ) ho. 


2 ESSA 
In this particular case the WKB approximation actually delivers the exact allowed 
energies (which are precisely the odd energies of the ful] harmonic oscillator— see 


V(x) = | 


[8.49] 


Problem 2.42). 
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Example 8.4 Potential well with no vertical walls. Equation 8.46 connects the 
WKB wave functions at a turning point where the potential slopes upward 
(Figure 8.11(a)); the same reasoning, applied to a downward-sloping turning point 
(Figure 8.11(b)), yields (Problem 8.9) 


D' | ji [ ; ; 
—— exp | -— |lp(x )|dx |. ifx <x: 
Pay L Ads 


(x)= í 8.50 
y(x) 2D! T i TNR d — [ ] 
FIC; S1 h A pix )ax AV ` Xie 


In particular, if we're talking about a potential well (Figure 8.11(c)), the wave 
function in the “interior” region (xj < x < x2) can be written either as 


i= cire. Wea T (x) dx! +2 
- = Xe LJ X = TT X X —— 
V P(x) x : h Jx P 4 


(Equation 8.46), or as 


/ 


EI p(x) 


" : l $ / / TT 
sin ı (x). where ð (x) = -S p(x )dx — T 


Xi 


(Equation 8.50). Evidently the arguments of the sine functions must be equal, mod- 
ulo z:!? 6) = 6; + nz, from which it follows that 


X5 l 
p(x)dx = ¢ — :) mh. withn=1,2,3,.... [8.51] 
V(x) 
| 
E ( E E 
Xo X X X» 
X 
(a) (b) (c) 


FIGURE 8.11: Upward-sloping and downward-sloping turning points. 


I? Not 22 —an overall minus sign can be absorbed into the normalization factors D and D'. 
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This quantization condition determines the allowed energies for the “typical” 
case of a potential well with two sloping sides. Notice that it differs from the 
formulas for two vertical walls (Equation 8.16) or one vertical wall (Equation 8.47) 
only in the number that is subtracted from n (0, 1/4, or 1/2). Since the WKB 
approximation works best in the semi-classical (large 1) regime, the distinction is 
more in appearance than in substance. In any event, the result is extraordinarily 
powerful, for it enables us to calculate (approximate) allowed energies without ever 
solving the Schródinger equation, by simply evaluating one integral. The wave 
function itself has dropped out of sight. 


x xProblem 8.5 Consider the quantum mechanical analog to the classical problem of 
a ball (mass m) bouncing elastically on the floor. !? 


(a) What is the potential energy, as a function of height x above the floor? (For 
negative x, the potential is infinite —the ball can't get there at all.) 


(b) Solve the Schródinger equation for this potential, expressing your answer in 
terms of the appropriate Airy function (note that Bi (z) blows up for large z, 
and must therefore be rejected). Don't bother to normalize w(x). 


(c) Using g = 9.80 m/s? and m = 0.100 kg, find the first four allowed energies, 
in joules, correct to three significant digits. Hint: See Milton Abramowitz and 
Irene A. Stegun, Handbook of Mathematical Functions, Dover, New York 
(1970), page 478; the notation is defined on page 450. 


(d) What is the ground state energy, in eV, of an electron in this gravitational 
field? How high off the ground is this electron, on the average? Hint: Use 
the virial theorem to determine (x). 


xProblem 8.6 Analyze the bouncing ball (Problem 8.5) using the WKB approxi- 
mation. 


(a) Find the allowed energies, E,, in terms of m, g, and ħ. 


(b) Now put in the particular values given in Problem 8.5(c). and compare the 
WKB approximation to the first four energies with the "exact" results. 


(c) About how large would the quantum number 7 have to be to give the ball 
an average height of, say, | meter above the ground? 


'3For more on the quantum bouncing ball see J. Gea-Banacloche, Am. J. Phys. 67, 776 (1999) and ne 
N. Wheeler. “Classical/quantum dynamics in a uniform gravitational field." unpublished Reed College 
report (2002). This may sound like an awfully artificial problem, but the experiment has actually been H 
done. using neutrons (V. V. Nesvizhevsky et al., Nature 415, 297 (2002)). 
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xProblem 8.7 Use the WKB approximation to find the allowed energies of the 
harmonic oscillator. 


Problem 8.8 Consider a particle of mass m in the nth stationary state of the 
harmonic oscillator (angular frequency w). 


(a) Find the turning point, x2. 


(b) How far (d) could you go above the turning point before the error in the 
linearized potential (Equation 8.32, but with the turning point at x?) reaches 
1%? That is, if 


V (x2 +d) — Vin(x2 + d) 


— 0.01, 
V (x2) 


what is d? 


(c) The asymptotic form of Ai(z) is accurate to 1% as long as z > 5. For the d 
in part (b), determine the smallest n such that ad > 5. (For any n larger than 
this there exists an overlap region in which the linearized potential is good 
to 1% and the large-z form of the Airy function is good to 1%.) 


x xProblem 8.9 Derive the connection formulas at a downward-sloping turning 
point, and confirm Equation 8.50. 


x x xProblem 8.10 Use appropriate connection formulas to analyze the problem of 
scattering from a barrier with sloping walls (Figure 8.12). Hint: Begin by writing 
the WKB wave function in the form 


] 
Vv P(x) 


EP Spe dy! im sd af 
| Ae! Jy! poc) dx + Be b J.) pix) dx | (x <x): 


toys x Ly! 2A i » x! Ly! 
(x)= [cet ho | p(x’)| dx 4+ De ^ de, PO da | (Qu <x <4): 


1 
V Ip (x)| 


|| i ft ae) 
——— | Fe?" ‘ (X > x2). 8.52 
al : a 


Do not assume C = 0. Calculate the tunneling probability, T = |F|?/|Al?. 
and show that your result reduces to Equation 8.22 in the case of a broad, high 
barrier. 
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FIGURE 8.12: Barrier with sloping walls. 


FURTHER PROBLEMS FOR CHAPTER 8 


x Problem 8.11 Use the WKB approximation to find the allowed energies of the 
general power-law potential: 


V (x) 2 a|x|". 


where v is a positive number. Check your result for the case v = 2. Answer: !4 


g r(3+4) (35) 
Esca G1 2h = m [8.53] 
2ma r (: ) 


"L2 
V 


x xProblem 8.12 Use the WKB approximation to find the bound state energy for the 


potential in Problem 2.51. Compare the exact answer. Answer: —[(9/8) — (1/4/2)] 
h^a? [m. 


Problem 8.13 For spherically symmetrical potentials we can apply the WKB 
approximation to the radial part (Equation 4.37). In the case/ = 0 it is reasonable"? 
to use Equation 8.47 in the form 


| i p(r)dr = (n —1/4)xh. [8.54] 
0 


H As always. the WKB result is most accurate in the semi-classical (large n) regime. In particular, 
Equation 8.53 is not very good for the ground state (n = 1). See W. N. Mei. Am. J. Phys. 66, 541 (1998). 
I^ Application of the WKB approximation to the radial equation raises some delicate and subtle 


problems. which I will not go into here. The classic paper on the subject is R. Langer. Phys. Rev. 51, 
669 (1937). 
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where rg is the turning point (in effect, we treat r = O as an infinite wall). 
Exploit this formula to estimate the allowed energies of a particle in the logarithmic 
potential 


V(r) = Vo ln(r/a) 


(for constants Vo and a). Treat only the case / = 0. Show that the spacing between 
the levels is independent of mass. Partial answer: 


n HA) 


Ej4| — En = Voln (c 


x xProblem 8.14 Use the WKB approximation in the form 


f | p(r)dr = (n — 1/2)zh [8.55] 


to estimate the bound state energies for hydrogen. Don't forget the centrifugal term 
in the effective potential (Equation 4.38). The following integral may help: 


b 
- (x —a)(b —x)dx = zb — Jay, [8.56] 


Note that you recover the Bohr levels when n >> / and n > 1/2. Answer: 


—13.6 eV 


En S ___ > FT 8.57 
"= In - 0/2 + Ji + DP [8.57] 


x x «Problem 8.15 Consider the case of a symmetrical double well, such as the one 
pictured in Figure 8.13. We are interested in bound states with E < V(0). 


(a) Write down the WKB wave functions in regions (i) x > x2, (ii) x < x < x», 
and (iii) 0 < x < xı. Impose the appropriate connection formulas at x; and 
X2 (this has already been done, in Equation 8.46, for x»; you will have to 
work out x; for yourself). to show that 


D E tren 
FIC exp E f. Ip) dx | (i) 
? X 
y(x) = NES sin Hi po») dx! + 4 (ii) 


l2 COS eli JE lp ax + sin de hc EE (iii) 


D 
A IpGx)] 


Further Problems for Chapter 8 337 


FIGURE 8.13: Symmetrical double well; Problem 8.15. 


where 


h 


(b) Because V(x) is symmetric, we need only consider even (+) and odd (—) 
wave functions. In the former case w’(0) = 0, and in the latter case (0) = 0. 
Show that this leads to the following quantization condition: 


] X2 
0 = — | p(x) dx. [8.58] 
XI 


tang = 26. [8.59] 


where E 
ó- Ji Ipa] dx’. [8.60] 
h =N] 


Equation 8.59 determines the (approximate) allowed energies (note that E 
comes into x; and x2, so 0 and ¢ are both functions of E). 


(c) We are particularly interested in a high and/or broad central barrier, in which 
case @ is large, and e? is huge. Equation 8.59 then tells us that @ must be 
very close to a half-integer multiple of x. With this in mind, write 0 = 
(n + 1/2)z +e. where |e| « 1, and show that the quantization condition 
becomes 


l ] 
~ = ae 
p= (n+5) nz 50 ; [8.61] 


(d) Suppose each well is a parabola: !6 


] 
-mw (x ta), ifx <0. 
V(x) = [8.62] 


jme — a)*, if x > 0. 


16Even if V(x) is not strictly parabolic in each well. this calculation of 6. and hence the result 
(Equation 8.63) will be approximately correct. in the sense discussed at the beginning of Section 2.3, 
with w = J/V"(xg)/m. where xp is the position of the minimum. 
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— 
oO 
~ 


— 
Lm 
~e 


Sketch this potential, find 9 (Equation 8.58), and show that 


h 
p = (x " 5) hor <et, [8.63] 
2 2 


Comment: If the central barrier were impenetrable ($ — oo), we would 
simply have two detached harmonic oscillators, and the energies, E, = (n + 
1/2)hiw, would be doubly degenerate, since the particle could be in the left 
well or in the right one. When the barrier becomes finite (putting the two wells 
into communication"), the degeneracy is lifted. The even states (V) have 
slightly lower energy. and the odd ones (y; ) have slightly higher energy. 


Suppose the particle starts out in the right well—or, more precisely, in a 
state of the form 


Edi ir 
V(x.0) = Fava TY 


which, assuming the phases are picked in the "natural" way, will be concen- 
trated in the right well. Show that it oscillates back and forth between the 


wells, with a period 


22? 
e?. [8.64] 


T = 
wW 


Calculate ¢, for the specific potential in part (d), and show that for V (0) > E, 
$ ~ mwa [h. 


Problem 8.16 Tunneling in the Stark effect. When you turn on an external elec- 
tric field, the electron in an atom can, in principle, tunnel out, ionizing the atom. 
Question: Is this likely to happen in a typical Stark effect experiment? We can 
estimate the probability using a crude one-dimensional model, as follows. Imagine 
a particle in a very deep finite square well (Section 2.6). 


(a) 


(b) 


What is the energy of the ground state, measured up from the bottom of the 
well? Assume Vo >> fi?/ma*. Hint: This is just the ground state energy of 
the infinite square well (of width 2a). 


Now introduce a perturbation H' = —ox (for an electron in an electric 
field E = —E,,,f we would have a = eE,;44). Assume it is relatively weak 
(aa « h?/ma?). Sketch the total potential, and note that the particle can 
now tunnel out, in the direction of positive x. 


Calculate the tunneling factor y (Equation 8.22), and estimate the time 
it would take for the particle to escape (Equation 8.28). Answer: y — 


/ 8m V3 /3ah, t = (8ma?/n l)e. 
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(d) Put in some reasonable numbers: Vo = 20 eV (typical binding energy for an 
outer electron), a = 107!0 m (typical atomic radius), Ec = 7 x 106 V/m 
(strong laboratory field), e and m the charge and mass of the electron. Cal- 
culate t, and compare it to the age of the universe. 


Problem 8.17 About how long would it take for a can of beer at room temperature 
to topple over spontaneously, as a result of quantum tunneling? Hint: Treat it as a 
uniform cylinder of mass m, radius R, and length h. As the can tips, let x be the 
height of the center above its equilibrium position (4/2). The potential energy is 
ingx, and it topples when x reaches the critical value xo = y R? + (h/2)? — h/2. 
Calculate the tunneling probability (Equation 8.22), for E — 0. Use Equation 8.28, 
with the thermal energy ((1 /2)mv* = (1/2)kgT) to estimate the velocity. Put in 
reasonable numbers, and give your final answer in years.!7 


ITR, E. Crandall, Scientific American, February 1997, p. 74. 


CHAPTER 9 


TIME-DEPENDENT 
PERTURBATION THEORY 


Up to this point, practically everything we have done belongs to the subject that 
might properly be called quantum statics, in which the potential energy function is 
independent of time: V (x.t) = V(r). In that case the (time-dependent) Schrödinger 
equation, 


OW 
AW —ih—. 
at 


can be solved by separation of variables: 
Yr. f) = y(r)e EN, 
where y (r) satisfies the time-independent Schrödinger equation, 


Hy = Ey. 


Because the time dependence of separable solutions is carried by the exponential 
factor (e~!£'/"), which cancels out when we construct the physically relevant quan- 
tity |W|?, all probabilities and expectation values are constant in time. By forming 
linear combinations of these stationary states we obtain wave functions with more 
interesting time dependence, but even then the possible values of the energy, and 
their respective probabilities, are constant. 

If we want to allow for transitions (quantum jumps, as they are sometimes 
called) between one energy level and another, we must introduce a time-dependent 
potential (quantum dynamics). There are precious few exactly solvable problems 
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in quantum dynamics. However, if the time-dependent portion of the Hamiltonian 
is small compared to the time-independent part, it can be treated as a perturbation. 
My purpose in this chapter is to develop time-dependent perturbation theory, and 
study its most important application: the emission or absorption of radiation by 
an atom. 


9.1 TWO-LEVEL SYSTEMS 


To begin with. let us suppose that there are just /wo states of the (unperturbed) 
system, Wy and Yp. They are eigenstates of the unperturbed Hamiltonian, H9: 


H? Ya = Ea Wa. and Hwy = Egy. [9.1] 


and they are orthonormal: 
(WalWb) = ĉab. [9.2] 


Any state can be expressed as a linear combination of them; in particular, 
W(0) = cava + Cb Vb. [9.3] 


The states Ya and yp might be position-space wave functions, or spinors, or 
something more exotic—it doesn't matter; it is the zime dependence that concerns 
us here, so when J write Y(t), I simply mean the state of the system at time f. 
In the absence of any perturbation, each component evolves with its characteristic 
exponential factor: 


V (f) = CaWge Ph + cy Vrpe i EAR, [9.4] 


We say that |c,|* is the “probability that the particle is in state Ya” —by which 
we really mean the probability that a measurement of the energy would yield the 
value E,,. Normalization of V requires, of course, that 


2 
[= 


lcal? Jen? = 1. [9.5] 


9.1.1 The Perturbed System 


Now suppose we turn on a time-dependent perturbation. H'(t). Since Ya and vij, 
constitute a complete set, the wave function WV (f) can still be expressed as a linear 
combination of them. The only difference is that c; and c; are now functions of t: 


W(t) = calpa I P^ + ey (ry e! PHP [9.6] 


(I could absorb the exponential factors into c,(f) and cp(t), and some people 
prefer to do it this way, but I think it is nicer to keep visible that part of the 
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time dependence that would be present even without the perturbation.) The whole 
problem is to determine c; and cp, as functions of time. If, for example, the particle 
started out in the state Wy (c, (0) = 1. c5 (0) = 0), and at some later time ¢,; we find 
that c4 (fi) = 0, c5 (fi) = 1, we shall report that the system underwent a transition 
from Ya to Wp. 

We solve for c, (f) and cy (t) by demanding that V (f) satisfy the time-dependent 
Schródinger equation, 


aw 
HV = ii. where H = H? + H'(t). [9.7] 


From Equations 9.6 and 9.7, we find: 


Cal H pale E + es H9 yr je El + e [H' Yale P7. + es Hy je E 


= in| és ue "Eat + ép Vrpe i ENI’ 


iE ; iE 
t CaWa (- m) g thun + cpp (-) gris. 


In view of Equation 9.1, the first two terms on the left cancel the last two terms 
on the right, and hence 


cal H Wale 91 es H'et E mih [i ye ése EA]. [9.8] 


To isolate ¢,, we use the standard trick: Take the inner product with y,, and 
exploit the orthogonality of Ya and Yp (Equation 9.2): 


ca (Va | H' |a )e E + e (al H lope P^ = ine et Bath, 


For short. we define 
H;; = (ilH vg): [9.9] 


note that the hermiticity of H’ entails Hj; = (H;;)*. Multiplying through by 
—(i/h)e!®u'/", we conclude that: 


i LIE, — 
Ca = h E Haa tb Haye ix ROI * [9.10] 
Similarly, the inner product with y; picks out ċp: 


Ca (Vp | H' Wale 1 PP + en Gyu E H"|yp)e Eh = ihèpe i E, 


and hence . 
E l TP E 
Cp — ^h [co Hp, + Ca Hie (En eou 4 [9.1 1] 
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Equations 9.10 and 9.11 determine c,(t) and c,(t); taken together, they are 
completely equivalent to the (time-dependent) Schrédinger equation, for a two-level 
system. Typically, the diagonal matrix elements of H’ vanish (see Problem 9.4 for 
the general case): 

H’ 


aa 


= Hj, =0. [9.12] 


If so, the equations simplify: 


[9.13] 


where 


E,- E 
wy = ae [9.14] 


(T 1l assume that Ej > Ea, so wo > 0.) 


xProblem 9.1 A hydrogen atom is placed in a (time-dependent) electric field E — 
E (jk. Calculate all four matrix elements H; j of the perturbation H’ = eEz between 
the ground state (n = 1) and the (quadruply degenerate) first excited states (n = 2). 
Also show that H;, = 0 for all five states. Note: There is only one integral to be 
done here, if you exploit oddness with respect to z; only one of the n = 2 states 
is "accessible" from the ground state by a perturbation of this form, and therefore 


the system functions as a two-state configuration—assuming transitions to higher 
excited states can be ignored. 


xProblem 9.2 Solve Equation 9.13 for the case of a time-independent perturbation, 
assuming that c,(0) = 1 and c,(0) = 0. Check that |eq(t)|? + Jes (0)? = 1. 
Comment: Ostensibly, this system oscillates between "pure V," and “some Wy.” 
Doesn’t this contradict my general assertion that no transitions occur for time- 
independent perturbations? No, but the reason is rather subtle: In this case yw, and 
V, are not, and never were, eigenstates of the Hamiltonian—a measurement of the 
energy never yields E, or Ep. In time-dependent perturbation theory we typically 
contemplate turning on the perturbation for a while, and then turning it off again, 
in order to examine the system. At the beginning, and at the end, Ya and Wy are 
eigenstates of the exact Hamiltonian, and only in this context does it make sense 
to say that the system underwent a transition from one to the other. For the present 
problem, then, assume that the perturbation was turned on at time ¢ = 0. and 
off again at time —this doesn't affect the calculations, but it allows for a more 
sensible interpretation of the result. 


x xProblem 9.3 Suppose the perturbation takes the form of a delta function (in time): 


H' = US(t): 
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assume that Usa = Upp = 0, and let Uap = Up, = a. If ca(—oo) = 1 and 
c,(—00) = 0, find ca(t) and cp(t), and check that |c.(t)|* + |ep(t)|? = 1. What is 
the net probability (P,-,;, for t — oo) that a transition occurs? Hint: You might 
want to treat the delta function as the limit of a sequence of rectangles. Answer: 
P, = sin? (Jo|/h). 


9.1.2 Time-Dependent Perturbation Theory 


So far, everything is exact: We have made no assumption about the size of the per- 
turbation. But if H’ is "small," we can solve Equation 9.13 by a process of succes- 
sive approximations, as follows. Suppose the particle starts out in the lower state: 


cu, (0) 21. cp(0) = 0. [9.15] 
If there were no perturbation at all, they would stay this way forever: 


Zeroth Order: 
c —1, co. [9.16] 


(I'll use a superscript in parentheses to indicate the order of the approximation.) 
To calculate the first-order approximation, we insert the zeroth-order values 
on the right side of Equation 9.13: 


First Order: 
1) 
dcl 
7 = > Pn — |: 
c 
jes! [9.17] 
ac 1 1) I / ^s dct" } 
= ——H, gi"?! — el e" H, (t )e ^" dt’. 
dt jp d ħ Jo ™ 
Now we insert these expressions on the right, to obtain the second-order 
approximation: 
Second Order: 


dct? i è i ! . , 
: / — ; / / 
f = Th Hape month (-;) i Hy, (t je! "n! dt => 


1 t t 
cC? (t) =I- zl Hip (t ^jg ier | 
1 JO 0 


while cp is unchanged (P(t) = = ci. (Notice that ce 2) (t) includes the zeroth- 
order term; the second-order correction would be the integral part alone.) 


[9.18] 


d 


Hja (ei a” dt’. 
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In principle, we could continue this ritual indefinitely, always inserting the 
nth-order approximation into the right side of Equation 9.13, and solving for the 
(n + 1)th order. The zeroth order contains no factors of H’, the first-order cor- 
rection contains one factor of H’, the second-order correction has two factors 
of H', and so on.! The error in the first-order approximation is evident in the 


fact that ie ng + let qo? Æ 1 (the exact coefficients must, of course, obey 


Equation 9.5). However, MOI + leg? (OP is equal to | to first order in H’, 
which is all we can expect from a first-order approximation. And the same goes 


for the higher orders. 


x «Problem 9.4 Suppose you don't assume Hia = Hp, = 0. 


(a) Find c4(f) and cp(t) in first-order perturbation theory, for the case cq(0) = 
1, cp(0) = 0. Show that [cf (t)? + [ef [2 = 1, to first order in H’. 


H 


(b) There is a nicer way to handle this problem. Let 


d, = eh fo HM e dp = ef fo His Dd o, [9.19] 
Show that 
d = i ib py! tet | : d = i -i$ py! ient q 9.20 
a= Es abe Gp: b= h bha as [9.20] 
where y: 
$(t)z 7) [H,, (r^) — Appt’) dt’. [9.21] 


So the equations for d; and dp are identical in structure to Equation 9.13 
(with an extra factor e/? tacked onto H^). 


(c) Use the method in part (b) to obtain c, (ft) and cp (t) in first-order perturbation 
theory, and compare your answer to (a). Comment on any discrepancies. 


xProblem 9.5 Solve Equation 9.13 to second order in perturbation theory, for the 
general case ca (0) = a. c, (0) = b. 


x xProblem 9.6 Calculate c;(r) and cp(t), to second order, for a time-independent 
perturbation (Problem 9.2). Compare your answer with the exact result. 


l Notice that c, is modified in every even order. and cp in every odd order: this would not be 
true if the perturbation included diagonal terms, or if the systein started out in a linear combination of 
the two states. 
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9.1.3 Sinusoidal Perturbations 


Suppose the perturbation has sinusoidal time dependence: 


H'(r. t) = V(r) cos(ot). [9.22] 

so that 
H,, = Vab cos(ot), [9.23] 

where 
Vab = (WalV|wWo). [9.24] 


(As before, Pl] assume the diagonal matrix elements vanish, since this is almost 
always the case in practice.) To first order (from now on we’ll work exclusively in 
first order, and I'll dispense with the superscripts) we have (Equation 9.17): 


i ! . " i Vi j y " n , 
colt) >~ -Via f cos(wt el! dt FN ba | Du d goce dt’ 
eee 2h: Jo 
V gott _ | el -9 _ | 
ei, [9.25] 
2h wo +w Wo — w 


That's the answer, but it’s a little cumbersome to work with. Things sinplify 
substantially if we restrict our attention to driving frequencies (w) that are very 
close to the transition frequency (wo), so that the second term in the square brackets 
dominates; specifically, we assume: 


w +w > |w — ol. [9.26] 


This is not much of a limitation, since perturbations at other frequencies have 
a negligible probability of causing a transition anyway.? Dropping the first term, 


we have 
i(wp—ev)t /2 
c(t) c Vba gom eae " ge 
2h og—o 
Vba sin —w)t ; 
— -i »- = /2] e?» /2. [9.27] 


The transition probability —the probability that a particle which started out in 
the state w, will be found, at time f, in the state V; —is 


| Vab]? sin?[(wo — 1/2] 


Pa->p(t) = lep (t)? ES [9.28] 


h? (wo — w)? 


?In the following sections we will be applying this theory to the case of light, for which w ~ 10!5 


=I so the denominator in both terms is huge. except (for the second one) in the neighborhood of wp. 


S 
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FIGURE 9.1: Transition probability as a function of time, for a sinusoidal perturba- 
tion (Equation 9.28). 


The most remarkable feature of this result is that, as a function of time, the 
transition probability oscillates sinusoidally (Figure 9.1). After rising to a maxi- 
mum of |Van |? /h? (cog — w)? — necessarily much less than 1. else the assumption 
that the perturbation is "small" would be invalid—it drops back down to zero! 
At times t, = 2nz/|oo — «|, where n = 1.2.3...., the particle is certain to 
be back in the lower state. If you want to maximize your chances of provoking 
a transition, you should rot keep the perturbation on for a long period; you do 
better to turn it off after a time 2/|w9 — «|, and hope to “catch” the system in 
the upper state. In Problem 9.7 it is shown that this "flopping" is not an artifact 
of perturbation theory —it occurs also in the exact solution, though the flopping 
frequency is modified somewhat. 

As I noted earlier, the probability of a transition is greatest when the driving 
frequency is close to the "natural" frequency, wo. This is illustrated in Figure 9.2, 
where P,» is plotted as a function of c. The peak has a height of (| Vaplt /25)? and 
a width 47 /t; evidently it gets higher and narrower as time goes on. (Ostensibly, 
the maximum increases without limit. However, the perturbation assumption breaks 


P(o) 


Wo o 
(© — 2n/t) (@ + 2n/t) 


FIGURE 9.2: Transition probability as a function of driving frequency (Equation 9.28). 
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down before it gets close to 1, so we can believe the result only for relatively small 
t. In Problem 9.7 you will see that the exact result never exceeds 1.) 


x «Problem 9.7 The first term in Equation 9.25 comes from the e/^'/2 part of cos(«t), 
and the second from e Thus dropping the first term is formally equivalent 
to writing H’ = (V/2)e^'*', which is to say, 


Y Vab ; 
LN 0 — —itot po 4P fowt 
Hga =e Hime. [9.29] 


£ 
(The latter is required to make the Hamiltonian matrix hermitian—or, if you pre- 
fer, to pick out the dominant term in the formula analogous to Equation 9.25 
for ca(t).) Rabi noticed that if you make this so-called rotating wave approxi- 
mation at the beginning of the calculation, Equation 9.13 can be solved exactly, 
with no need for perturbation theory, and no assumption about the strength of 
the field. 


(a) Solve Equation 9.13 in the rotating wave approximation (Equation 9.29), for 
the usual initial conditions: c; (0) = 1. c,(0) = 0. Express your results (c, (t) 
and cp (t)) in terms of the Rabi flopping frequency, 


| / 
wr = 2 (o — wo)? + (|Vao|/h)?. [9.30] 


(b) Determine the transition probability, P;.., (1), and show that it never exceeds 
1. Confirm that |cq(t)|? + Jes (0)? = 1. 


(c) Check that Py_.,(t) reduces to the perturbation theory result (Equation 9.28) 
when the perturbation is "small," and state precisely what small means in 
this context, as a constraint on V. 


(d) At what time does the system first return to its initial state? 


9.2 EMISSION AND ABSORPTION OF RADIATION 


9.2.1 Electromagnetic Waves 


An electromagnetic wave (I'll refer to it as "light," though it could be infrared, 
ultraviolet, microwave, X-ray, etc.; these differ only in their frequencies) consists 
of transverse (and mutually perpendicular) oscillating electric and magnetic fields 
(Figure 9.3). An atom, in the presence of a passing light wave, responds primarily 
to the electric component. If the wavelength is long (compared to the size of the 


Section 9.2: Emission and Absorption of Radiation 349 


Electric field 


Direction of 
propagation 


Magnetic field 


FIGURE 9.3: An electromagnetic wave. 


atom), we can ignore the spatial variation in the field;* the atom, then, is exposed 
to a sinusoidally oscillating electric field 


E = Eo cos(wt) k [9.31] 


(for the moment I’ll assume the light is monochromatic, and polarized along the z 
direction). The perturbing Hamiltonian is* 


H' = —qEoz cos(ot). [9.32] 
where q is the charge of the electron? Evidently® 
Hy, = —fEocos(ot). where p = q (Yp\z|WYa). [9.33] 


Typically, y is an even or odd function of z; in either case z|V|^ is odd, and 
integrates to zero (see Problem 9.1 for some examples). This licenses our usual 
assumption that the diagonal matrix elements of H’ vanish. Thus the interaction of 


3For visible light A ~ 5000À. while the diameter of an atom is around 1 Å. so this approximation 
is reasonable: but it would not be for X-rays. Problem 9.21 explores the effect of spatial variation of 
the field. 


+The energy of a charge q in a static field E is —q f E- dr. You may well object to the use of 
an electrostatic formula for a manifestly time-dependent field. I am implicitly assuming that the period 
of oscillation is long compared to the time it takes the charge to move around (within the atom). 


* As usual, we assume the nucleus is heavy and stationary: it is the wave lunction of the electron 
that concerns us. 


The letter p is supposed to remind you of electric dipole moment (for which. in electrodynam- 
ics, the letter p is customarily used—in this context it is rendered as a squiggly p to avoid confusion 
with momentum). Actually. œ is the off-diagonal matrix element of the z component of the dipole 
moment operator. qr. Because of its association with electric dipole moments. radiation governed by 
Equation 9.33 is called electric dipole radiation; it is overwhelmingly the dominant kind, at least in 
the visible region. See Problem 9.21 for gencralizations and terminology. 
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light with matter is governed by precisely the kind of oscillatory perturbation we 
studied in Section 9.1.3, with 


Vba = —g Eo. [9.34] 


9.2.2 Absorption, Stimulated Emission, and Spontaneous Emission 


If an atom starts out in the "lower" state Ya, and you shine a polarized monochro- 
matic beam of light on it, the probability of a transition to the "upper" state yp is 
given by Equation 9.28, which (in view of Equation 9.34) takes the form 


TR ee 
f) sin^[(co w)t/2) [9.35] 


Pa sp (t) = ( h 


(wo — w)? 


In this process, the atom absorbs energy Ep — Ea = ħwo from the electromagnetic 
field. We say that it has "absorbed a photon" (Figure 9.4(a)). (As I mentioned ear- 
lier, the word "photon" really belongs to quantum electrodynamics [the quantum 
theory of the electromagnetic field], whereas we are treating the field itself classi- 
cally. But this language is convenient, as long as you don't read more into it than 
is really there.) 

I could, of course, go back and run the whole derivation for a system that starts 
off in the upper state (c,(0) = 0, c5 (0) = 1). Do it for yourself, if you like; it comes 
out exactly the same—except that this time we're calculating Ppa = |ca(t)|?, the 
probability of a transition down to the lower level: 

Ie|Eo * sint 1/2] 
P| J sin” [(wo — w)t/ [9.36] 


VEHI ( h (wo — w)? 


(It has to come out this way —all we're doing is switching a <> b, which substitutes. 
—wo for «o. When we get to Equation 9.25 we now keep the first term, with 
—wo + w in the denominator, and the rest is the same as before.) But when you 
stop to think of it, this is an absolutely astonishing result: If the particle is in the 
upper state, and you shine light on it, it can make a transition to the /ower state, 
and in fact the probability of such a transition is exactly the same as for a transition 
upward from the lower state. This process, which was first predicted by Einstein, 
is called stimulated emission. 


b —————2——— b b 
SNS SY 
SINI SILLY ININ IN aS. LV LVILnTS 
a a a 
(a) Absorption (b) Stimulated emission (c) Spontaneous emission 


FIGURE 9.4: Three ways in which light interacts with atoms: (a) absorption, 
(b) stimulated emission, (c) spontaneous emission. 
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In the case of stimulated emission the electromagnetic field gains energy hwo 
from the atom; we say that one photon went in and two photons came out—the 
original one that caused the transition plus another one from the transition itself 
(Figure 9.4(b)). This raises the possibility of amplification, for if I had a bottle 
of atoms, all in the upper state, and triggered it with a single incident photon, a 
chain reaction would occur, with the first photon producing 2, these 2 producing 
4, and so on. We'd have an enormous number of photons coming out, all with the 
same frequency and at virtually the same instant. This is, of course, the principle 
behind the laser (light amplification by stimulated emission of radiation). Note that 
it is essential (for laser action) to get a majority of the atoms into the upper state 
(a so-called population inversion), because absorption (which costs one photon) 
competes with stimulated emission (which produces one); if you started with an 
even mixture of the two states, you'd get no amplification at all. 

There is a third mechanism (in addition to absorption and stimulated emission) 
by which radiation interacts with matter; it is called spontaneous emission. Here an 
atom in the excited state makes a transition downward, with the release of a photon, 
but without any applied electromagnetic field to initiate the process (Figure 9.4(c)). 
This is the mechanism that accounts for the typical decay of an atomic excited 
state. At first sight it is far from clear why spontaneous emission should occur 
at all. If the atom is in a stationary state (albeit an excited one), and there is no 
external perturbation, it should just sit there forever. And so it would, if it were 
really free of all external perturbations. However, in quantum electrodynamics the 
fields are nonzero even in the ground state —just as the harmonic oscillator (for 
example) has nonzero energy (to wit: fiw/2) in its ground state. You can turn out 
all the lights, and cool the room down to absolute zero, but there is still some 
electromagnetic radiation present, and it is this "zero point" radiation that serves. 
to catalyze spontaneous emission. When you come right down to it, there is really 
no such thing as truly spontaneous emission; it's all stimulated emission. The only 
distinction to be made is whether the field that does the stimulating is one that you 
put there, or one that God put there. In this sense it is exactly the reverse of the 
classical radiative process, in which it's all spontaneous, and there is no such thing 
as stimulated emission. 

Quantum electrodynamics is beyond the scope of this book,’ but there is a 
lovely argument, due to Einstein,? which interrelates the three processes (absorp- 
tion, stimulated emission, and spontaneous emission). Einstein did not identify 
the mechanism responsible for spontaneous emission (perturbation by the ground- 
state electromagnetic field), but his results nevertheless enable us to calculate the 


"For an accessible treatment see Rodney Loudon, The Quantum Theory of Light, 2nd ed. (Claren- 
don Press, Oxford. 1983). 


8Einstein's paper was published in 1917, well before the Schródinger equation. Quantum elec- 
trodynantics comes into the argument via the Planck blackbody formula (Equation 5.113). which dates 
from 1900. 
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spontaneous emission rate, and from that the natural lifetime of an excited atomic 
state.” Before we turn to that, however, we necd to consider the response of an atom 
to non-monochromatic, unpolarized, incoherent electromagnetic waves coming in 
from all directions—such as it would encounter, for instance, if it were immersed 
in thermal radiation. 


9.2.3 Incoherent Perturbations 


The energy density in an electromagnetic wave is!° 
€ 
u= Ej. [9.37] 


where Ep is (as before) the amplitude of the electric field. So the transition prob- 
ability (Equation 9.36) is (not surprisingly) proportional to the energy density of 
the fields: à 

2u asin [(wo — w)t/2] 


Py-sa(t) = zlzl 
€oh- 


3 
(«9 — w)? [3.38] 


But this is for a monochromatic wave. at a single frequency w. In many applica- 
tions the system is exposed to electromagnetic waves at a whole range of fre- 
quencies; in that case u — p(w)dw, where p(w)dw is the energy density in 
the a) range dw, and the net transition probability takes the form of an 
integral: 


dw. [9.39] 


oo . 2 
Pis) = lal? f p d TE 


eoh? (wo — w)? 


U : A : a AE ; a H - . , 
? For an interesting alternative derivation using "seat-o[-the-pants" quantum electrodynamics, see 
Problem 9.9. 


IOD, Griffiths. Introduction to Electrodynamics, 3rd ed. (Prentice Hall. Upper Saddle River. NJ. 
1999), Section 9.2.3. In general, the energy per unit volume in electromagnetic fields is 


? P ? 
u = (eo/2) E^ + (1/249) Bo. 
For electromagnetic waves, the electric and magnetic contributions are equal, so 
7 2 2 
u= eo E? = é) Ej Cos” (wt). 


and the average over a full cycle is (€9/2) Eg. since the average ol cos? (or sin?) is 1/2. 

ll Equation 9,39 assumes that the perturbations at different frequencies are independent, so that 
the total transition probability is a sum of the individual probabilities. If the different components 
are coherent (phase-correlated). then we should add amplitudes (cj (t)). not probabilities (\cy(t 17). 
and there will be cross-terms. For the applications we will consider the perturbations are always 
incoherent. 
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The term in curly brackets is sharply peaked about wo (Figure 9.2), whereas 
p(w) is ordinarily quite broad, so we may as well replace p(w) by o(wo), and take 
it outside the integral: 


Ael oy s sin? [fo — 21/2] 
( 


Phoalt) = dw. [9.40] 


eo? (wo — w)? 


Changing variables to x = (wy — w)t/2, extending the limits of integration to 
x = too (since the integrand is essentially zero out there anyway), and looking 
up the definite integral 


© sin? x , 
"o dx — z. 19.41] 
—OQ P 
we find 3 
rial 
Pp—va(t) = ^ p(oo)t. [9.42] 
T 


This time the transition probability is proportional to t. The bizarre “flopping” 
phenomenon characteristic of a monochromatic perturbation gets "washed out" 
when we hit the system with an incoherent spread of frequencies. In particular, the 
transition rate (R = dP/dt) is now a constant: 


IT 
Rba = — cp pwo). [9.43] 
eoh* 


So far, we have assumed that the perturbing wave is coming in along the v 
direction (Figure 9.3), and polarized in the z direction. But we are interested in 
the case of an atom bathed in radiation coming from all directions, and with all 
possible polarizations; the energy in the fields (o(«)) is shared equally among these 
different modes. What we need, in place of |z|, is the average of |p - A|?, where 


p= qilriva) [9.44] 


(generalizing Equation 9.33), and the average is over all polarizations and all inci- 
dent directions. 

The averaging can be carried out as follows: Choose spherical coordinates 
such that the direction of propagation (Å) is along x, the polarization (ñ) is along 
z. and the vector f» defines the spherical angles 0 and $ (Figure 9.5). ^ (Actually, 


ITI reat g as though it were real. even though in general it will be complex. Since 


|p nl? = |Re(p)-n+iIm(p)- al = |Re(p)- al? + |Im(p) nl? 
we can do the whole calculation for the real and imaginary parts separately. and simply add the results. 


In Equation 9.47 the absolute value sign is doing double duty. signifying both the vector magnitude 
and to the complex amplitude: 


lal = ps + ley + lec. 
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FIGURE 9.5: Axes for the averaging of |p - |. 


f. is fixed here, and we're averaging over all Å and ji consistent with ÅL — which 
is to say, over all à and @. But it's really the coordinate system, not the vector p, 
that is changing.) Then 

p-n=pcosé, [9.45] 


and 


2 l jo 
Ip - Flare - | laos 0 sin0 d6 do 


lel ( cos? 9 X |" NE 
- - 2x) = =|pl. 46 
"s 3 ; (27) zla] [9.46] 


Conclusion: The transition rate for stimulated emission from state b to state a, 
under the influence of incoherent, unpolarized light incident from all directions, is 


zt 


Rha = 121 o (coo). [9.47] 


3egh? 


where 42 is the matrix element of the electric dipole moment between the two states 
(Equation 9.44), and o(«o) is the energy density in the fields, per unit frequency, 
evaluated at w = (Ep — E4)/h.? 


l*This is a special case of Fermi's Golden Rule for time-dependent perturbation theory. which 
says that the transition rate is proportional to the square of the matrix element of the perturbing potential 
and to the strength of the perturbation at the transition frequency. 
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9.3 SPONTANEOUS EMISSION 


9.3.1 Einstein's A and B Coefficients 


Picture a container of atoms, N4 of them in the lower state (Ya), and Nj of them in 
the upper state (Yp). Let A be the spontaneous emission rate,!* so that the number 
of particles leaving the upper state by this process, per unit time, is N)A.!> The 
transition rate for stimulated emission, as we have seen (Equation 9.47), is propor- 
tional to the energy density of the electromagnetic field: Bj, p (œ); the number of 
particles leaving the upper state by this mechanism, per unit time, is Nj Bpa p (%0). 
The absorption rate is likewise proportional to o(wo)—call it Bapo (wo); the num- 
ber of particles per unit time joining the upper level is therefore Ny Bap p (wo). All 


told, then, 
dN, 


dt 
Suppose these atoms are in thermal equilibrium with the ambient field, so that 


the number of particles in each level is constant. In that case d Np /dt = 0, and it 
follows that 


= —N A — Np Bha p(c0) + Na Bap p(wo). [9.48] 


A 


—— [9.49] 
(Na/ Np) Bab e Boa 


p(w) = 


On the other hand, we know from elementary statistical mechanics!? that the 
number of particles with energy E, in thermal equilibrium at temperature T, is 
proportional to the Boltzmann factor, exp( — E/ kg T), so 

Na e EalkaT 


n> eee [9.50] 
b nk 


and hence 
A 


—— 9.51 
ghoo/ kg 1 Bap — Bpa | | 


p (wo) = 


But Planck's blackbody formula (Equation 5.113) tells us the energy density 
of thermal radiation: 


| 1A 7» 5 
Comparing the two expressions, we conclude that 
Bab = Boa [9.53] 


HNormally I'd use R for a transition rate, but out of deference to der Alte everyone follows 
Einstein's notation in this context. 

15 Assume that Np and Nj are very large, so we can treat them as continuous functions of time 
and ignore statistical fluctuations. 

lóSee. for example. Charles Kittel and Herbert Kroemer. Thermal Physics. 2nd ed. (Freeman. 
New York. 1980), Chapter 3. 
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and 


ma Boa. [9.54] 


Equation 9.53 confirms what we already knew: The transition rate for stimu- 
lated emission is the same as for absorption. But it was an astonishing result 
in 1917—-indeed, Einstein was forced to "invent" stimulated emission in order 
to reproduce Planck's formula. Our present attention, however, focuses on 
Equation 9.54, for this tells us the spontaneous emission rate ( A)— which is what 
we are looking for—in terms of the stimulated emission rate (Bpa 0(w0))— which 
we already know. From Equation 9.47 we read off 


Tm 


2 
e 9.55 
Reh? lel. [9.55] 


Boa = 


and it follows that the spontaneous emission rate is 


3 2 
— lel 


= ———á. 9.56 
3z eghic? [2:36] 


Problem 9.8 As a mechanism for downward transitions, spontaneous emission 
competes with thermally stimulated emission (stimulated emission for which black- 
body radiation is the source). Show that at room temperature (T = 300 K) thermal 
stimulation dominates for frequencies well below 5 x 10!? Hz, whereas spontaneous 
emission dominates for frequencies well above 5 x 10!* Hz. Which mechanism 
dominates for visible light? 


Problem 9.9 You could derive the spontaneous emission rate (Equation 9.56) with- 
out the detour through Einstein's A and B coefficients if you knew the ground-state 
energy density of the electromagnetic field, oo(«), for then it would simply be a 
case of stimulated emission (Equation 9.47). To do this honestly would require 
quantum electrodynamics, but if you are prepared to believe that the ground state 
consists of one photon in each mode, then the derivation is very simple: 


(a) Replace Equation 5.111 by N,, = dg, and deduce pp(w). (Presumably this 
formula breaks down at high frequency, else the total *vacuum energy" would 
be infinite ... but that's a story for a different day.) 


(b) Use your result, together with Equation 9.47, to obtain the spontaneous emis- 
sion rate. Compare Equation 9.56. 
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9.3.2 The Lifetime of an Excited State 


Equation 9.56 is our fundamental result; it gives the transition rate for spontaneous 
emission. Suppose, now, that you have somehow pumped a large number of atoms 
into the excited state. As a result of spontaneous emission, this number will decrease 
as time goes on; specifically, in a time interval dt you will lose a fraction A dt 
of them: 

d Ny = —AN>pat, [9.57] 


(assuming there is no mechanism to replenish the supply).!? Solving for Np(t), 
we find: 
Ni (t) = Ny(0)e ^; [9.58] 


evidently the number remaining in the excited state decreases exponentially, with 
a time constant 


p [9.59] 
We call this the lifetime of the state—technically, it is the time it takes for N, (t) 
to reach 1/e z 0.368 of its initial value. 

I have assumed all along that there are only two states for the system, 
but this was just for notational simplicity —the spontaneous emission formula 
(Equation 9.56) gives the transition rate for y; — Wa regardless of what other 
states may be accessible (see Problem 9.15). Typically, an excited atom has many 
different decay modes (that is: Y, can decay to a large number of different lower- 
energy states, Waj, Vas. Wa,.--.). In that case the transition rates add, and the net 


lifetime is 


T= ———————. 
Ay Ag AS eee 


[9.60] 


Example 9.1 Suppose a charge q is attached to a spring and constrained to oscil- 
late along the x-axis. Say it starts out in the state |”) (Equation 2.61), and decays 
by spontaneous emission to state |’). From Equation 9.44 we have 


g—aq(mn|xin'y. 


You calculated the matrix elements of x back in Problem 3.33: 


| h 
(nix |n) = Suo (Vw Ón 1 + Vit By.n-1) . 


'7 This situation is not to be confused with the case of thermal equilibrium. which we considered 
in the previous section. We assume here that the atoms have been lifted out of equilibrium, and are in 
the process of cascading back down to their equilibrium levels. 
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where w is the natural frequency of the oscillator. (I no longer need this letter for 
the frequency of the stimulating radiation.) But we're talking about emission, so n' 
must be /ower than n; for our purposes, then. 


| nh ` 
=q 5 Ón'.n-.1 i. [9.61] 
Zo 


Evidently transitions occur only to states one step lower on the "ladder," and the 

frequency of the photon emitted is 

E,— Ey — (nc 1/2)ho — (n' t 1/2)ho 
ho h 

Not surprisingly, the system radiates at the classical oscillator frequency. The tran- 

sition rate (Equation 9.56) is 


w) = =(n—-n')w=w. [9.62] 


nq?o* 
Eran LE 9.63 
6z egmc? ` [9.63] 
and the lifetime of the nth stationary state is 
6r eom c? 
n = — [9.64] 
ngq-w* 


Meanwhile, each radiated photon carries an energy fiw, so the power radiated is 
AR o: 


qu 


T 6x egi c? cine), 


or, since the energy of an oscillator in the mth state is E = (n + 1/2)ho, 


qo" (£ n [9.65] 
m SS — EPOD M A 
67 egm c? 2 E 


This is the average power radiated by a quantum oscillator with (initial) energy £. 

For comparison, let's determine the average power radiated by a classical 
oscillator with the same energy. According to classical electrodynamics, the power 
radiated by an accelerating charge q is given by the Larmor formula:!? 


P. 
q'a* 
P = à [9.66] 
6: eoc? 
For a harmonic oscillator with amplitude xo. x (1!) = xo cos(wr), and the acceleration 
is a = —xow* cos(wt). Averaging over a full cycle, then, 
q?xgo? 
—— Q2zeoc? 


I3See, for example. Griffiths (footnote 10). Section 11.2.1. 
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But the energy of the oscillator is E = (1 /2)mo? x3, SO a = 2E/mw”, and hence 


gw 
[9.67] 


^ neme ` 


This is the average power radiated by a classical oscillator with energy E. In the 
classical limit (ñ — 0) the classical and quantum formulas agree;!? however, 
the quantum formula (Equation 9.65) protects the ground state: If E = (1/2)hw the 
oscillator does not radiate. 


Problem 9.10 The half-life (1; /2) of an excited state is the time it would take for 
half the atoms in a large sample to make a transition. Find the relation between 
tjj? and c (the "lifetime" of the state). 


x x xProblem 9.11 Calculate the lifetime (in seconds) for each of the four n = 2 
states of hydrogen. Hint: You'll need to evaluate matrix elements of the form 
(Vio0|x |V200).. (V'i00l Y |Va1;), and so on. Remember that x = rsin@cos@, y = 
rsin@ sing, and z = r cos0. Most of these integrals are zero, so scan them before 
you start calculating. Answer: 1.60 x 107? seconds for all except W200, which 
is infinite. 


9.3.3 Selection Rules 


The calculation of spontaneous emission rates has been reduced to a matter of 
evaluating matrix elements of the form 


(Vp Ir| va). 


As you will have discovered if you worked Problem 9.11 (if you didn't, go back 
right now and do so!), these quantities are very often zero, and it would be helpful 
to know in advance when this is going to happen, so we don't waste a lot of 
time evaluating unnecessary integrals. Suppose we are interested in systems like 
hydrogen, for which the Hamiltonian is spherically symmetrical. In that case we 
may specify the states with the usual quantum numbers n, /, and m, and the matrix 
elements are 
(i lm |r|nim). 


I91n fact, if we express P in terms of the energy above the ground state, the two formulas are 
identical. 


360 Chapter 9 Time-Dependent Perturbation Theory 


Clever exploitation of the angular momentum commutation relations and the her- 
miticity of the angular momentum operators yields a set of powerful constraints on 
this quantity. 


Selection rules involving mm and im’: Consider first the commutators of L- 
with x, y, and z, which we worked out in Chapter 4 (see Equation 4.122): 
[L:..x]2 ihy, [L:.y])=—ihx, [L-,2] =0. [9.68] 
From the third of these it follows that 


0 = (n'm UL. z]|nlm) = (n'l'm'|(L-z — zL-)|nlm) 


= (nlm mhz — z(mh)]|nim) = (m' — m)h(n'l'm'|z|nim). 


Conclusion: 
Either m’ =m, or else (n'lm'|z|nlm) = 0. [9.69] 


So unless m’ = in, the matrix elements of z are always zero. 
Meanwhile, from the commutator of L- with x we get 


(lm! |L- x]Jnlin) = (wlm (L-x — x L:)|Inlin) 


= (m — m)h(n'l'm|x|nlm) = ih(n'lm!|y|nlm). 


Conclusion: 
(m — m)(n'lm!|x|nlm) = i(n'lm'|y|nlm). [9.70] 


So you never have to compute matrix elements of y —you can always get them 
from the corresponding matrix elements of x. 
Finally, the commutator of L- with y yields 


(n'l'm'|[L-. yJ|nlm) = (n lm'|(L:y — y L:)|nlm) 


= (m — m)h(n'lm'|y|nlm) = —ih(n'lm'|x|nlm). 


Conclusion: 
(m — m)(i'im!|y|nlm) = —i (n'lm|x|nim). [9.71] 


In particular, combining Equations 9.70 and 9.71, 
(n! — my (n'l m |x|nlm) = i(m' — mytn'lm'|y|lnlm) = (n'l'm |x|ntm). 
and hence: 


Either (m' — m)? — 1. or else (n'lm'|x|nIm) = (n'lm'|y|nlm) =0. — [9.72] 
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From Equations 9.69 and 9.72 we obtain the selection rule for m: 


[9.73] 


This is an easy result to understand, if you remember that the photon carries spin 
1, and hence its value of m is 1, 0, or —1;9 conservation of (the z component of) 
angular momentum requires that the atom give up whatever the photon takes away. 


Selection rules involving / and /': In Problem 9.12 you are asked to derive 
the following commutation relation: 


ps [L?. 3] = oh? (rL? + Ln), [9.74] 


As before, we sandwich this commutator between (r'/'m'| and |n/m) to derive the 
selection rule: 


(nlm |[L?. (L? v]]pndm) = 28? (n l m (L? + L?r)|ntm) 
= 2h^U( + LEU + D) Gm! |r|lndm) = (n'm (LLL. v] — £2. r] L?) nim) 
= Rm 41) + a)uoimlp.r])nm) 
=RU 3-1) — 10 + Dlm (Lr — rL?)|nim) 
= n^ (^ 4-1) — 10 EDP (im Ir|ntm). [9.75] 
Conclusion: 
Either 2[/(! + 1) - l'( + 1] =U 4-1) 210 4+ DP 
or else (n'lm'|r|nIm) = 0. [9.76] 
But 
[(" 3-1) 240 +D] = + DG — 10 
and 
2UQ +I I0 0] 2 0 LED? + 03 = 1 
so the first condition in Equation 9.76 can be written in the form 


[QU 4-14- 0? — ya! - 0? — 1] 2 0. [9.77] 


20When the polar axis is along the direction of propagation. the middle value does not occur. and 
if you are only interested in the nuniber of linearly independent photon states. the answer is 2. not 3. 
However. in this case the photon need not be going in the z direction. and all three values are possible. 
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FIGURE 9.6: Allowed decays for the first four Bohr levels in hydrogen. 


The first factor cannot be zero (unless l’ = | = 0—this loophole is closed in 
Problem 9.13), so the condition simplifies to /’ = l + 1. Thus we obtain the selec- 


tion rule for /: 
No transitions occur unless A/ = +1. [9.78] 


Again, this result (though far from trivial to derive) is easy to interpret: The 
photon carries spin 1, so the rules for addition of angular momentum would allow 
l’=1+4+ 1,1 =], or l’ = l — 1 (for electric dipole radiation the middle possi- 
bility —though permitted by conservation of angular momentum—does not occur). 


Evidently not all transitions to lower-energy states can proceed by spontaneous 
emission; some are forbidden by the selection rules. The scheme of allowed transi- 
tions for the first four Bohr levels in hydrogen is shown in Figure 9.6. Note that the 
2S state (W200) is "stuck": It cannot decay, because there is no lower-energy state 
with / = 1. It is called a metastable state, and its lifetime is indeed much longer 
than that of, for example, the 2P states (Y211, W210, and WY21—1). Metastable states 
do eventually decay, by collisions, or by what are (misleadingly) called forbidden 
transitions (Problem 9.21), or by multiphoton emission. 


xProblem 9.12 Prove the commutation relation in Equation 9.74. Hint: First show 


that 
[L?, z] = 2ih(x Ly — yLy — ihz). 

Use this, and the fact that r- L=r- (r x p) = 0, to demonstrate that 
[£^ (07, 2] = 28? (zL? + E72). 


The generalization from z to r is trivial. 
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Problem 9.13 Close the “loophole” in Equation 9.78 by showing that if l’ =/ = 0 
then (n'Im!|r|nIm) = 0. 


x *Problem 9.14 An electron in the n = 3, / = 0, m = 0 state of hydrogen decays 
by a sequence of (electric dipole) transitions to the ground state. 


(a) What decay routes are open to it? Specify them in the following way: 
p 
|300)  [nlm) — |n’I’m’) — «-» > |100}. 


(b) If you had a bottle full of atoms in this state, what fraction of them would 
decay via each route? 


(c) What is the lifetime of this state? Hint: Once it’s made the first transition, 
it’s no longer in the state |300), so only the first step in each sequence is 
relevant in computing the lifetime. When there is more than one decay route 
open, the transition rates add. 


FURTHER PROBLEMS FOR CHAPTER 9 


x xProblem 9.15 Develop time-dependent perturbation theory for a multilevel sys- 
tem, starting with the generalization of Equations 9.1 and 9.2: 


Aovn = Ey Wns (Vs oai) = Ónm- [9.79] 


At time t = 0 we turn on a perturbation H’(r), so that the total Hamiltonian is 
H = Ho + H'(t). [9.80] 


(a) Generalize Equation 9.6 to read 


W(t) = Y enpe E, [9.81] 
and show that : 
; I (Ey, — 
im = —. a en [9.82] 
n 
where 
H} = (Yml H’ IYn). [9.83] 


(b) If the system starts out in the state Yy, show that (in first-order perturbation 
theory) 


- pt 
ivi S es Í Hi y(t’) dt. [9.84] 
h Jo 
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and 
B H : : 
cw (t) = EE Í Hi y ()e En ENY Ph dt. (mN). [9.85] 


(c) For example. suppose H’ is constant (except that it was turned on at t = 0, 
and switched off again at some later time 7). Find the probability of transition 
from state N to state M (M Æ N), as a function of t. Answer: 

- 2 
sin£[(Ey — Ey )t/2h 
4| Hu eves n J [9.86] 
(En — Eu) 

(d) Now suppose H’ is a sinusoidal function of time: H’ = V cos(wt). Making 
the usual assumptions, show that transitions occur only to states with energy 
Em = En X ho, and the transition probability is 

Sin [(Ey — Em t ho)t/2h] 

(Ey — Em X hio? 

(e) Suppose a multilevel system is immersed in incoherent electromagnetic radi- 

ation. Using Section 9.2.3 as a guide. show that the transition rate for stim- 


ulated emission is given by the same formula (Equation 9.47) as for a two- 
level system. 


Py M = |Vunl [9.87] 


Problem 9.16 For the examples in Problem 9.15(c) and (d), calculate c(t), to 
first order. Check the normalization condition: 


Do lem OP = 1, [9.88] 
m 
and comment on any discrepancy. Suppose you wanted to calculate the probability 
of remaining in the original state Vw; would you do better to use lev (f^, or 


pe Snan lem(t)|?7? 


Problem 9.17 A particle starts out (at time ! = 0) in the Nth state of the infinite 
square well. Now the “floor” of the well rises temporarily (maybe water leaks 
in, and then drains out again), so that the potential inside is uniform but time 
dependent: Vo(t), with Vo(0) = Vo(T) = 0. 


(a) Solve for the exact c,,(t), using Equation 9.82, and show that the wave 
function changes phase, but no transitions occur. Find the phase change, 
Q (T), in terms of the function Vo(t). 


(b) Analyze the same problem in first-order perturbation theory, and compare 
your answers. 


Comment: The same result holds whenever the perturbation simply adds a 
constant (constant in x, that is, not in 7) to the potential; it has nothing to do with 
the infinite square well, as such. Compare Problem 1.8. 
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«Problem 9.18 A particle of mass m is initially in the ground state of the (one- 
dimensional) infinite square well. At time t = 0 a “brick” is dropped into the well, 
so that the potential becomes 

Vo. if 0 < X < a/2, 
V(x) = 40, ifa/2<x <a. 
co, otherwise, 
where Vo « Ej. After a time 7, the brick is removed, and the energy of the 


particle is measured. Find the probability (in first-order perturbation theory) that 
the result is now £5. 


Problem 9.19 We have encountered stimulated emission, (stimulated) absorption, 
and spontaneous emission. How come there is no such thing as spontaneous absorp- 
tion? 


* x xProblem 9.20 Magnetic resonance. A spin-1/2 particle with gyromagnetic ratio 


y, at rest in a static magnetic field Bok, precesses at the Larmor frequency wo = 
y By (Example 4.3). Now we turn on a small transverse radiofrequency (rf) field, 
Bys[cos(wt) ? — sin(wt) j), so that the total field is 


B = Br cos(wt) î — By sin(wt) f + Bok. [9.89] 
(a) Construct the 2 x 2 Hamiltonian matrix (Equation 4.158) for this system. 


(b) If x(t) = te is the spin state at time f, show that 


a= z (Qep + woa ) | b= ; (Qe "ta — wob) . [9.90] 


where Q = y B, is related to the strength of the rf field. 


(c) Check that the general solution for a(t) and b(t), in terms of their initial 
values ao and bg, is 


a(t) — | ag cos(cw't /2) + -< [ao(co — w) + boQ] sin(w't m} grae 
b(t) = L cos(w't /2) + a [bo (c) — wo) + ag€2] 5ine1/2] p ot 
wW 


where 


w = V (w — wp)? + R. [9.91] 
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(d) If the particle starts out with spin up (ie. ag = 1, bọ = 0), find the 
probability of a transition to spin down, as a function of time. Answer: 
P(t) = {27/[(w — wo)? + 27)} sin? (w't/2). 


(e) Sketch the resonance curve, 


Q2 


e oa 


[9.92] 
as a function of the driving frequency w (for fixed wo and $2). Note that the 
maximum occurs at w = wy. Find the “full width at half maximum,” Aw. 


(f) Since wọ = y Bo, we can use the experimentally observed resonance to deter- 
mine the magnetic dipole moment of the particle. In a nuclear magnetic res- 
onance (nmr) experiment the g-factor of the proton is to be measured, using 
a Static field of 10,000 gauss and an rf field of amplitude 0.01 gauss. What 
will the resonant frequency be? (See Section 6.5 for the magnetic moment 
of the proton.) Find the width of the resonance curve. (Give your answers 
in Hz.) 


x x xProblem 9.21 In Equation 9.31 I assumed that the atom is so small (in comparison 


to the wavelength of light) that spatial variations in the field can be ignored. The 
true electric field would be 


E(r. t) = Eo cos(k  r — or). [9.93] 


If the atom is centered at the origin, then k-r < 1 over the relevant volume 
(|k| = 27/A, so k-r ~ r/X « 1), and that's why we could afford to drop this 
term. Suppose we keep the first-order correction: 


K(r, t) = Eo[cos(wr) + (k -r) sin(wt)]. [9.94] 


The first term gives rise to the allowed (electric dipole) transitions we considered 
in the text; the second leads to so-called forbidden (magnetic dipole and electric 
quadrupole) transitions (higher powers of k-r lead to even more “forbidden” 
transitions, associated with higher multipole moments).?! 


(a) Obtain the spontaneous emission rate for forbidden transitions (don’t bother 
to average over polarization and propagation directions, though this should 
really be done to complete the calculation). Answer: 


2753 
Rpa = —— llall ry rb. [9.95] 
z eghic? 


?! For a systematic treatment (including the role of the magnetic field) see David Park, Introduction 
to the Quantum Theory, 3rd ed. (McGraw-Hill, New York. 1992), Chapter 11. 
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(b) Show that for a one-dimensional oscillator the forbidden transitions go from 
level n to level n — 2, and the transition rate (suitably averaged over ñ 
and k) is 

hq?o?n (n — 1) 
R = —————r. [9.96] 
15z egm? c? 
(Note: Here c is the frequency of the photon, not the oscillator.) Find the 
ratio of the "forbidden" rate to the “allowed” rate, and comment on the 
terminology. 


(c) Show that the 25 — 1S transition in hydrogen is not possible even by a 
"forbidden" transition. (As it turns out, this is true for all the higher multipoles 
as well; the dominant decay is in fact by two-photon emission, and the lifetime 
is about a tenth of a second?) 


x x xProblem 9.22 Show that the spontaneous emission rate (Equation 9.56) for a tran- 
sition from n,/ to n'. l’ in hydrogen is 


I+] 
qm pue at 

gag Spa in 

EE x [9.97] 

3x eohic? l eee 
a ee S 

where a 
T f 3 Ry) Ryp(r) dr. [9.98] 
0 


(The atom starts out with a specific value of m, and it goes to any of the states mm’ 
consistent with the selection rules: m’ = m + 1. m. or m — 1. Notice that the answer 
is independent of m.) Hint: First calculate all the nonzero matrix elements of x, v, 
and z between |n/m) and |n'i'm^) for the case /’ = / + 1. From these, determine 
the quantity 


Hn EE 1. m + lrini? + (i. 14-3 mlrinim) + n+ Loam — Hral p. 


Then do the same for /’ — | — 1. 


*2See Masataka Mizushima, Quantum Mechanics of Atomic Spectra and Atomic Structure, Ben- 
jamin, New York (1970). Section 5.6. 


CHAPTER 10 


THE ADIABATIC APPROXIMATION 


10.1 THE ADIABATIC THEOREM 


10.1.1 Adiabatic Processes 


Imagine a perfect pendulum, with no friction or air resistance, oscillating back and 
forth in a vertical plane. If you grab the support and shake it in a jerky manner the 
bob will swing around chaotically. But if you very gently and steadily move the 
support (Figure 10.1), the pendulum will continue to swing in a nice smooth way, 
in the same plane (or one parallel to it), with the same amplitude. This gradual 
change of the external conditions defines an adiabatic process. Notice that there are 
two characteristic times involved: 7;, the “internal” time, representing the motion 
of the system itself (in this case the period of the pendulum's oscillations), and 7;, 
the "external" time, over which the parameters of the system change appreciably (if 
the pendulum were mounted on a vibrating platform, for example, 7, would be the 
period of the platform's motion). An adiabatic process is one for which T, >> 7;.! 

The basic strategy for analyzing an adiabatic process is first to solve the 
problem with the external parameters held constant, and only at the end of the 
calculation allow them to vary (slowly) with time. For example, the classical period 
of a pendulum of (fixed) length L is 27./L/g: if the length is now gradually 
changing, the period will presumably be 27./L(t)/g. A more subtle example 
occurred in our discussion of the hydrogen molecule ion (Section 7.3). We began 


lFor an interesting discussion of classical adiabatic processes. see Frank S. Crawford. Am. J. 
Phys. 58, 337 (1990). 
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FIGURE 10.1: Adiabatic motion: If the case is 
transported very gradually, the pendulum inside keeps 
swinging with the same amplitude, in a plane parallel 
to the original one. 


by assuming that the nuclei were at rest, a fixed distance R apart, and we solved 
for the motion of the electron. Once we had found the ground state energy of 
the system as a function of R, we located the equilibrium separation and from 
the curvature of the graph we obtained the frequency of vibration of the nuclei 
(Problem 7.10). In molecular physics this technique (beginning with nuclei at rest, 
calculating electronic wave functions, and using these to obtain information about 
the positions and—relatively sluggish— motion of the nuclei) is known as the 
Born-Oppenheimer approximation. 

In quantum mechanics, the essential content of the adiabatic approximation 
can be cast in the form of a theorem. Suppose the Hamiltonian changes gradually 
from some initial form H! to some final form H”. The adiabatic theorem states 
that if the particle was initially in the nth eigenstate of H', it will be carried 
(under the Schrédinger equation) into the nth eigenstate of H f. (1 assume that the 
spectrum is discrete and nondegenerate throughout the transition from H’ to H f 
so there is no ambiguity about the ordering of the states; these conditions can be 
relaxed, given a suitable procedure for “tracking” the eigenfunctions, but I’m not 
going to pursue that here.) 

For example, suppose we prepare a particle in the ground state of the infinite 
square well (Figure 10.2(a)): 


vi(x) = [Asin (=x) [10.1] 


If we now gradually move the right wall out to 2a, the adiabatic theorem says that 
the particle will end up in the ground state of the expanded well (Figure 10.2(b)): 


: 1 T 
Fazio 
yt (x)= , sin (57x) : [10.2] 
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y'(x) w(x) w(x) 


a 2ax a 2a x a 2a x 


(a) (b) (c) 


FIGURE 10.2: (a) Particle starts out in the ground state of the infinite square well. (b) 
If the wall moves slowly, the particle remains in the ground state. (c) If the wall moves 
rapidly, the particle is left (momentarily) in its initial state. 


(apart, perhaps, from a phase factor). Notice that we're not talking about a small 
change in the Hamiltonian (as in perturbation theory)—this one is Auge. All we 
require is that it happen slowly. Energy is not conserved here: Whoever is moving 
the wall is extracting energy from the system, just like the piston on a slowly 
expanding cylinder of gas. By contrast, if the well expands suddenly, the resulting 
state is still w(x) (Figure 10.2(c)), which is a complicated linear combination of 
eigenstates of the new Hamiltonian (Problem 2.38). In this case energy is conserved 
(at least, its expectation value is); as in the free expansion of a gas (into a vacuum) 
when the barrier is suddenly removed, no work is done. 


* x xProblem 10.1 The case of an infinite square well whose right wall expands at a 
constant velocity (v) can be solved exactly.? A complete set of solutions is 


|2 . (nz (muy? 2 Ei 
o, CY, i= u sin (=) geen 2E,at)/2hw a [10.3] 


where w(t) = a + vt is the (instantaneous) width of the well and E] = 
n?z?h? /2ma? is the nth allowed energy of the original well (width a). The general 
solution is a linear combination of the d's: 


oc 
P.D = 3 eeu. D: [10.4] 
n=! 
the coefficients c, are independent of t. 


(a) Check that Equation 10.3 satisfies the time-dependent Schródinger equation, 
with the appropriate boundary conditions. 


*§. W. Doescher and M. H. Rice. Am. J. Phys. 37, 1246 (1969). 
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(b) Suppose a particle starts out (t = 0) in the ground state of the initial well: 


PES 
vV(x.0) = |/-—sin (=x) : 
a a 
Show that the expansion coefficients can be written in the form 


2 d p 
£j = | e !** sin(nz) sin(z) dz. [10.5] 
7T Jo 


where o = mva/27?h is a dimensionless measure of the speed with which 
the well expands. (Unfortunately, this integral cannot be evaluated in terms 
of elementary functions.) 


(c) Suppose we allow the well to expand to twice its original width, so the 
"external" time is given by w(T;) = 2a. The "internal" time is the period of 
the time-dependent exponential factor in the (initial) ground state. Determine 
T, and T;, and show that the adiabatic regime corresponds to a « 1, so that 
exp(—iaz?) = ] over the domain of integration. Use this to determine the 
expansion coefficients, c,. Construct V (x. f), and confirm that it is consistent 
with the adiabatic theorem. 


(d) Show that the phase factor in W(x. t) can be written in the form 
1 t 
0(t) = -; f E\(t’) dt’, [10.6] 
0 


where E,(t) = n?n?h? |[2mw? is the instantaneous eigenvalue, at time f. 
Comment on this result. 


10.1.2 Proof of the Adiabatic Theorem 


The adiabatic theorem is simple to state, and it sounds plausible, but it is not easy 
to prove.? If the Hamiltonian is independent of time, then a particle which starts 
out in the mth eigenstate,* Wns 


H Yn = En Wn. [10.7] 


The theorem is usually attributed to Ehrenfest, who studied adiabatic processes in early versions 
of the quantum theory. The first proof in modern quantum mechanics was given by Born and Fock, 
Zeit. f. Physik 51, 165 (1928). Other proofs will be found in Messiah, Quantum Mechanics, Wiley. New 
York (1962). Vol. lI. Chapter XVII. Section 12, J-T Hwang and Philip Pechukas. J. Chem. Phys. 67. 
4640. 1977. and Gasiorowicz, Quantum Physics. Wiley. New York (1974). Chapter 22. Problem 6. The 
argument given here follows B. H. Bransden and C. J. Joachain. Introduction to Quantum Mechanics, 
2nd cd., Addison-Wesley. Boston. MA (2000). Section 9.4. 


H11 suppress the dependence on position (or spin. etc.): in this argument only the time depen- 
dence is at issue. 
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remains in the mth eigenstate, simply picking up a phase factor: 
P, (ft) = we Teel, [10.8] 


If the Hamiltonian changes with time, then the eigenfunctions and eigenvalues are 
themselves time-dependent: 


H (t) v, (t) = En(t)Wn(t). [10.9] 
but they still constitute (at any particular instant) an orthonormal set 


(Vn (Dm (0)) = nn. [10.10] 


and they are complete, so the general solution to the time-dependent Schrödinger 
equation 


ih wn = H(t) (t) [10.11] 


can be expressed as a linear combination of them: 


W(t) = Denne, [10.12] 


n 


where e 
an=- | E,, (t’) dt’ [10.13] 
0 


1 


generalizes the "standard" phase factor to the case where E, varies with time. 
(As usual, I could have included it in the coefficient c, (r), but it is convenient to 
factor out this portion of the time dependence, since it would be present even for 
a time-in dependent Hamiltonian.) 

Substituting Equation 10.12 into Equation 10.11 we obtain 


ih > [en Vn + €n Vn T icu WnOn | eem » Cn (Hwy )e9 [10.14] 


n n 


(I use a dot to denote the time derivative). In view of Equations 10.9 and 10.13 
the last two terms cancel, leaving 


5 Cn yy elo LES Nx Cn Vel, [10.15] 
n n 


Taking the inner product with V,,, and invoking the orthonormality of the instan- 
taneous eigenfunctions (Equation 10.10), 


] : t 0 
25 CnOmn€ Oy = — >D Cn (Win Wn de nd 
n n 
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or 


€ (f) = — Yo en (Wl Wine mak [10.16] 
n 


Now, differentiating Equation 10.9 with respect to time yields 
Hy, si Hy, = Èn Wn + EnWn. 


and hence (again taking the inner product with Wm) 


(Wml Elyn) + (Wm LH lin) = Endmn + En lm dn): [10.17] 


Exploiting the hermiticity of H to write (Y| H hin) = En Iin), it follows that 
for n #m 
(Wml H |y) = (En = Em) Qm Yn). [10.18] 


Putting this into Equation 10.16 (and assuming, remember, that the energies are 
nondegenerate) we conclude that 


id |H |j) s: 1 x Bre: ' , 
Cm (t)= = —Cm (m Vra) — ) Cy L——————- m E ! e! Ph) Jol Es) Ey (t )]di , [10.19] 
T em 
nm 


This result is exact. Now comes the adiabatic approximation: Assume that H 
is extremely small, and drop the second term,’ leaving 


m(t) = —Cm (Wn him). [10.20] 


with the solution 
Cm (t) = Cm (0) Yo [10.21] 


Yn (t) = if UE )|- 


In particular, if the particle starts out in the mth eigenstate (which is to say, if 
c, (0) = 1, and cm (0) = 0 for m Æ n), then (Equation 10.12) 


where? 


ay Vn (t ») ar. [10.22] 


[10.23] 


so it remains in the mth eigenstate (of the evolving Hamiltonian), picking up only 
a couple of phase factors. QED 


Rigorous justification of this step is not trivial. Sec A. C. Aguiar Pinto er al.. Am. J. Phys. 68. 
955 (2000). 


. ÓNotice that y is real. since the normalization of y, entails (dd/dt) Ym |Win) = (nlla) + 
(Wan Win) = 2Re (Win va) = 0. 
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Example 10.1 Imagine an electron (charge —e, mass m) at rest at the origin, 
in the presence of a magnetic field whose magnitude (Bo) is constant, but whose 
direction sweeps out a cone, of opening angle œ, at constant angular velocity w 
(Figure 10.3): 


B(t) = Bolsin æ cos(wt)i + sina sin(wr)j + cos ak]. [10.24] 


The Hamiltonian (Equation 4.158) is 


e ehBpo. . : ; 
H(t)=—B-S= 2 [sin æ cos(wr)o, + sino sin(wf)o, + cos ao-] 
m n i 
ha cosa ee! sing 
7 pe sina —cosa ) [10.25] 
where T 
ppm. [10.26] 


m 


The normalized eigenspinors of H (t) are 


x07 ( cos (ure) |: [10.27] 


el^! sin(a/2) 
and 
— cos(a/2) 


x-()- ae sin(a/ 3 [10.28] 


they represent spin up and spin down, respectively, along the instantaneous direc- 
tion of B(t) (see Problem 4.30). The corresponding eigenvalues are 


[10.29] 


FIGURE 10.3: Magnetic field sweeps around in a 
cone, at angular velocity w (Equation 10.24). 
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Suppose the electron starts out with spin up, along B(0):’ 


_ {cos(a/2) 
x (0) = ( Suo A ; [10.30] 


The exact solution to the time-dependent Schrödinger equation is (Problem 10.2): 
[cosas /2) — i LZP sin(ar/2) | costo ein? 


x) = . [1031] 
[eos(ar/2) = jit) sin(At /2)| sin(a/2)e*!*'? 


A= jw + wy — 20a COS Ql. [10.32] 


Or, expressing it as a linear combination of x+} and x-_: 


X(t) = feos (=) — oe — oem sin (>) e Py D) 


At 
+i E sin o sin @) etiot/2y_ (p), [10.33] 


where 


Evidently the (exact) probability of a transition to spin down (along the current 
direction of B) is 


2 
Mx@lx-()2 = [Esinsin (=)] | [10.34] 


The adiabatic theorem says that this transition probability should vanish in the limit 
T, >> Ti, where T, is the characteristic time for changes in the Hamiltonian (in this 
case, 1/w) and T; is the characteristic time for changes in the wave function (in this 
case, h/(E,. — E_) = 1/o1). Thus the adiabatic approximation means w « «oj: 
The field rotates slowly, in comparison with the phase of the (unperturbed) wave 
functions. In the adiabatic regime 4 = w4, and therefore 


2 


lax XOlx- (s |= sino sin (5)] — 0. [10.35] 


as advertised. The magnetic field leads the electron around by its nose, with the 
spin always pointing in the direction of B. By contrast, if w >> «| then A = c, and 
the system bounces back and forth between spin up and spin down (Figure 10.4). 


"This is essentially the same as Problem 9.20. except that now the electron starts out with spin 
up along B, whereas in Equation 9.20(d) it started out with spin up along z. 
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COUNT 


2n/À 4n/X 6x/A Br/r t 


FIGURE 10.4: Plot of the transition probability, Equation 10.34, in the nomadiabatic 
regime (w > w1). 


x xProblem 10.2 Check that Equation 10.31 satisfies the time-dependent Schrödinger 


equation for the Hamiltonian in Equation 10.25. Also confirm Equation 10.33, and 
show that the sum of the squares of the coefficients is 1, as required for normalization. 


10.2 BERRY'S PHASE 


10.2.1 Nonholonomic Processes 


Let's go back to the classical model I used (in Section 10.1.1) to develop the notion 
of an adiabatic process: a perfectly frictionless pendulum, whose support is carried 
around from place to place. I claimed that as long as the motion of the support is 
very slow, compared to the period of the pendulum (so that the pendulum executes 
many oscillations before the support has moved appreciably), it will continue to 
swing in the same plane (or one parallel to it), with the same amplitude (and, of 
course, the same frequency). 

But what if I took this ideal pendulum up to the North Pole, and set it swing- 
ing— say, in the direction of Portland (Figure 10.5). For the moment, pretend the 
earth is not rotating. Very gently (that is, adiabatically), I carry it down the lon- 
gitude line passing through Portland, to the equator. At this point it is swinging 
north-south. Now I carry it (still swinging north-south) part way around the equa- 
tor. And finally, I take it back up to the North Pole, along the new longitude line. 
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FIGURE 10.5: Itinerary for adiabatic transport of 
a pendulum on the surface of the earth. 


It is clear that the pendulum will no longer be swinging in the same plane as it 
was when I set out—indeed, the new plane makes an angle © with the old one, 
where © is the angle between the southbound and the northbound longitude lines. 

As it happens, © is equal to the solid angle (Q) subtended (at the center of the 
earth) by the path around which I carried the pendulum. For this path surrounds a 
fraction © /27 of the northern hemisphere, so its area is A = (1/2)(O/2z)4z R? = 
GR? (where R is the radius of the earth), and hence 


O0 — A/R? =Q. [10.36] 


This is a particularly nice way to express the answer, because it turns out to be 
independent of the shape of the path (Figure 10.6). 

Incidentally, the Foucault pendulum is an example of precisely this sort of 
adiabatic transport around a closed loop on a sphere— only this time instead of me 


FIGURE 10.6: Arbitrary path on the surface of a 
sphere, subtending a solid angle &. 


? You can prove this for yourself. if you are interested. Think of the circuit as being made up of 
tiny segments of great circles (geodesies on the sphere): the pendulum makes a fixed angle with each 
geodesic segment, so the net angular deviation is related to the sum of the vertex angles of the spherical 
polygon. 
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Z 


ane 


FIGURE 10.7: Path of a Foucault pendulum, in 
the course of one day. 


carrying the pendulum around, I let the rotation of the earth do the job. The solid 
angle subtended by a latitude line 0) (Figure 10.7) is 


Q= IET d6d$ = 27 (— cos ey = 2z(1- cos 6p). [10.37] 


Relative to the earth (which has meanwhile turned through an angle of 27), the 
daily precession of the Foucault pendulum is 27 cos 09 —a result that is ordinarily 
obtained by appeal to Coriolis forces in the rotating reference frame,? but is seen 
in this context to admit a purely geometrical interpretation. 

A system such as this, which does not return to its original state when trans- 
ported around a closed loop, is said to be nonholonomic. (The "transport" in 
question need not involve physical motion: What we have in mind is that the 
parameters of the system are changed in some fashion that eventually returns them 
to their initial values.) Nonholonomic systems are ubiquitous—in a sense, every 
cyclical engine is a nonholonomic dévice: At the end of each cycle the car has 
moved forward a bit, or a weight has been lifted slightly, or something. The idea 
has even been applied to the locomotion of microbes in fluids at low Reynolds 
number. !Ü My project for the next section is to study the quantum mechanics of 
nonholonomic adiabatic processes. The essential question is this: How does the 
final state differ from the initial state, if the parameters in the Hamiltonian are 
carried adiabatically around some closed cycle? 


?See. for example, Jerry B. Marion and Stephen T. Thornton, Classical Dynamics of Particles 
and Systems, 4th ed.. Saunders. Fort Worth, TX (1995). Example 10.5. Geographers measure latitude 
(A) up from the equator, rather than down from the pole, so cos @p = sin À. 


IOThe pendulum example is an application of Hannay's angle, which is the classical analog to 
Berry's phase. For a collection of papers on both subjects. see Alfred Shapere and Frank Wilczek. eds., 
Geometric Phases in Physics. World Scientific. Singapore (1989). 
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10.2.2 Geometric Phase 


In Section 10.1.2 I showed that a particle which starts out in the nth eigen- 
state of H(0) remains, under adiabatic conditions, in the nth eigenstate of H (t), 
picking up only a time-dependent phase factor. Specifically, its wave function is 
(Equation 10.23) 

Wy (1) = el rum, qr). [10.38] 


where 


1 t 
a=- | E (t') dt’ [10.39] 
h Jo 


is the dynamic phase (generalizing the usual factor exp(—iE,t/h) to the case 
where E,, is a function of time), and 


msi f E 


is the so-called geometric phase. 

Now y^, (f) depends on t because there is some parameter R(t) in the Hamil- 
tonian that is changing with time. (In Problem 10.1, R(r) would be the width of 
the expanding square well.) Thus 


a IYn 3 IYn dR 


zz Valt ») dt’ [10.40] 


= 10.41 
t — 8R dt 
so 
9 ð 
min =i f (vs gat = f” (w aa) dR, [10.42] 
Ri 
where R; and Ry are the initial and final values of R(t). In particular, if the 
Hamiltonian returns to its original form after time T, so that Rf = Rj, then 


Yn (T) = O—nothing very interesting there! 

However, I assumed (in Equation 10.41) that there is only one parameter in 
the Hamiltonian that is changing. Suppose there are N of them: R(t), R2(t),..., 
Ry (t); in that case 


Ds Be Ae ELEM en 
0t — OR| dt QR dt ORyN 


dR 
= (Vr Yr): T [10.43] 


where R = (Ri, Ry. .... Ry), and Vg is the gradient with respect to these param- 
eters. This time we have 


R 
ys J Us V gia) - dR. [10.44] 
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and if the Hamiltonian returns to its original form after a time 7, the net geometric 
phase change is 


Am ai $ (pn lVR Yn) - dR. [10.45] 


This is a line integral around a closed loop in parameter-space, and it is not, in 
general, zero. Equation 10.45 was first obtained by Michael Berry, in 1984,!! and 
Yn (T) is called Berry's phase. Notice that y, (T) depends only on the path taken, 
not on how fast that path is traversed (provided, of course, that it is slow enough 
to validate the adiabatic hypothesis). By contrast, the accumulated dynamic phase, 


1 fT ; 
Oa (T) = 23! E, (t’) dt’. 
h Jo 


depends critically on the elapsed time. 

We are accustomed to thinking that the phase of the wave function is arbi- 
trary — physical quantities involve ||?, and the phase factor cancels out. For this 
reason, most people assumed until recently that the geometric phase was of no 
conceivable physical significance—after all, the phase of v, (t) itself is arbitrary. 
It was Berry's insight that if you carry the Hamiltonian around a closed loop, 
bringing it back to its. original form, the relative phase at the beginning and the 
end of the process is not arbitrary, and can actually be measured. 

For example, suppose we take a beam of particles (all in the state W), and 
split it in two, so that one beam passes through an adiabatically changing potential, 
while the other does not. When the two beams are recombined, the total wave 


function has the form 1 
y = -Yo + ~ We", [10.46] 
2 2 
where Wo is the "direct" beam wave function, and I is the extra phase (in part 
dynamic, and in part geometric) acquired by the beam subjected to the varying H. 
In this case 


ONES ir zif 
IVP = zlto? (1+ eT) (1 +e) 
l | 
= 5 ola + cos T) = |Wo|? cos? (T /2). [10.47] 


So by looking for points of constructive and destructive interference (where [F is 
an even or odd multiple of zr, respectively), one can easily measure I". (Berry, and 


!! M. V. Berry. Proc. R. Soc. Lond. A 392. 45 (1984). reprinted in Wilezek and Shapere (foot- 
note 10). It is astonishing. in retrospect. that this result escaped notice for sixty years. 
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FIGURE 10.8: Magnetic flux through a 
surface S bounded by the closed curve C. 


other early writers, worried that the geometric phase might be swamped by a larger 
dynamic phase, but it has proved possible to arrange things so as to separate out 
the two contributions.) 

When the parameter space is three dimensional, R = (R1, R2, R3), Berry’s 
formula (Equation 10.45) is reminiscent of the expression for magnetic flux in 
terms of the vector potential A. The flux, ®, through a surface S bounded by a 
curve C (Figure 10.8), is 


ee | B- da. [10.48] 
S 


If we write the magnetic field in terms of the vector potential (B — V x A), and 
apply Stokes' theorem: 


o= [S xA)»du- d A-dr. [10.49] 
S C 


Thus Berry’s phase can be thought of as the "flux" of a “magnetic field" 
“B” = IVR X (Vn IV R Vn). [10.50] 


through the (closed loop) trajectory in parameter-space. To put it the other way 
around, in the three-dimensional case Berry's phase can be written as a surface 
integral, 


vds J [Ve x (Wal Va¥n)] da. [10.51] 


The magnetic analogy can be carried much further, but for our purposes 
Equation 10.51 is merely a convenient alternative expression for y, (T). 


xProblem 10.3 


(a) Use Equation 10.42 to calculate the geometric phase change when the infinite 
square well expands adiabatically from width w; to width w2. Comment on 
this result. 
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(b) If the expansion occurs at a constant rate (dw/dt = v), what is the dynamic 
phase change for this process? 


(c) If the well now contracts back to its original size, what is Berry’s phase for 
the cycle? 


Problem 10.4 The delta function well (Equation 2.114) supports a single bound 
state (Equation 2.129). Calculate the geometric phase change when œ gradually 
increases from æ; to a». If the increase occurs at a constant rate (da/dt = c), what 
is the dynamic phase change for this process? 


Problem 10.5 Show that if w,(f) is real, the geometric phase vanishes. 
(Problems 10.3 and 10.4 are examples of this.) You might try to beat the rap by 
tacking an unnecessary (but perfectly legal) phase factor onto the eigenfunctions: 
w(t) = elo» yy, (t), where @,(R) is an arbitrary (real) function. Try it. You'll get a 
nonzero geometric phase, all right, but note what happens when you put it back into 
Equation 10.23. And for a closed loop it gives zero. Moral: For nonzero Berry's 
phase, you need (i) more than one time-dependent parameter in the Hamiltonian, 
and (ii) a Hamiltonian that yields nontrivially complex eigenfunctions. 


Example 10.2 The classic example of Berry's phase is an electron at the ori- 
gin, subjected to a magnetic field of constant magnitude but changing direction. 
Consider first the special case (analyzed in Example 10.1) in which B(r) precesses 
around at a constant angular velocity w, making a fixed angle o with the z axis. 
The exact solution (for an electron that starts out with "spin up" along B) is given 
by Equation 10.33. In the adiabatic regime, w «& «1, 


l 2 
à = w] 1-22 cosa + (=) =w] (1- Z cosa) = «| —weosa, [10.52] 
w] 0| (| 


and Equation 10.33 becomes 


X (t) = eg FRIIS A a S CHE Y 
f t ; 
a [2 sino sin 5] ey (n, [10.53] 
w] 2 


As w/w, — 0 the second term drops out completely, and the result matches the 
expected adiabatic form (Equation 10.23). The dynamic phase is 


| [' t 
0. (t) = -; f E(t) dt = EAA [10.54] 
i Jo 2 
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(where Ey = fiw, /2, from Equation 10.29), so the geometric phase is 
ya (f) = (cosa — D. [10.55] 
For a complete cycle T = 27 /w, and therefore Berry’s phase is 
y4 (T) = m(cosa@ — 1). [10.56] 
Now consider the more general case, in which the tip of the magnetic field 


vector sweeps out an arbitrary closed curve on the surface of a sphere of radius 
r = Bo (Figure 10.9). The eigenstate representing spin up along B(f) has the form 


(see Problem 4.30): 
= cos(0/2) (10.57 
ARI el? sin(0/2)  ' ' 


where 0 and @ (the spherical coordinates of B) are now both functions of time. 
Looking up the gradient in spherical coordinates, we find 


OX+ a 18x4 A l Ieg 
or r 06 rsinü 3$ 


2. —(1/2)sin(0/2) Y . 1 0 . 
o". me ny iti r sind (sei Sam] Px [10.58] 


Hence 


ud 
(x41VX4) = 2 |- sin(8/2) cos(6/2) Ó + sin(8/2) cos(8/2) 6 + yee J 


» sin? (6/2) : 


10.5 
rsing l 3] 


FIGURE 10.9: Magnetic field of constant magni- 
tude but changing direction sweeps out a closed 
loop. 


384 Chapter 10 The Adiabatic Approximation 


For Equation 10.51 we need the curl of this quantity: 


1 8|. isin(0/2 \ |, i. 
V Mavic ein pessum 10.60 
X OX) = Lang 86 E sae pl ae ien 


According to Equation 10.51, then, 


l ! 
y+(T)= -3 | 55 [10.61] 
2 r2 


The integral is over the area on the sphere swept out by B in the course of the 
cycle, so da = r? dQ F, and we conclude that 


1 1 
y(T)=—5 [ aa=~39. [10.62] 


where Q is the solid angle subtended at the origin. This is a delightfully simple 
result, and tantalizingly reminiscent of the classical problem with which we began 
the discussion (transport of a frictionless pendulum around a closed path on the 
surface of the earth). It says that if you take a magnet, and lead the electron’s spin 
around adiabatically in an arbitrary closed path, the net (geometric) phase change 
will be minus one-half the solid angle swept out by the magnetic field vector. 
In view of Equation 10.37, this general result is consistent with the special case 
(Equation 10.56), as of course it had to be. 


x x «Problem 10.6 Work out the analog to Equation 10.62 for a particle of spin 1. 
Answer: —Q. (Incidentally, for spin s the result is —s{2.) 


10.2.3 The Aharonov-Bohm Effect 


In classical electrodynamics the potentials (v and A)!* are not directly measur- 
able—the physical quantities are the electric and magnetic fields: 


ð 
E = —Vọ — —. B=VxA. [10.63] 


I21t is customary in quantum mechanics to use the letter V for potential energy. but in electro- 
dynamics the same letter is ordinarily reserved for the scalar potential. To avoid confusion I use ¢ for 
the scalar potential. See Problems 4.59. 4.60, and 4.61 for background to this section. 
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The fundamental laws (Maxwell’s equations and the Lorentz force rule) make no 
reference to potentials, which are (from a logical point of view) no more thar 
convenient but dispensable theoretical constructs. Indeed, you can with impurs» 
change the potentials: 


aA 
Ce pac: A> A’ =A4+VA., [10.64] 


where A is any function of position and time; this is called a gauge transformation, 
and it has no effect on the fields (as you can easily check using Equation 10.63). 

In quantum mechanics the potentials play a more significant role, for the 
Hamiltonian is expressed in terms of g and A, not E and B: 


2 


| fh 2 
H- (Fv ~«a) 4 qu. [10.65] 


2m Vi 


Nevertheless, the theory is still invariant under gauge transformations (see 
Problem 4.61), and for a long time it was taken for granted that there could be 
no electromagnetic influences in regions where E and B are zero—any more than 
there can be in the classical theory. But in 1959 Aharonov and Bohm!? showed that 
the vector potential can affect the quantum behavior of a charged particle, even 
when it is moving through a region in which the field itself is zero. P11 work out a 
simple example first, then discuss the Aharonov-Bohm effect, and finally indicate 
how it all relates to Berry’s phase. 

Imagine a particle constrained to move in a circle of radius b (a bead on a 
wire ring, if you like). Along the axis runs a solenoid of radius a « b, carrying 
a steady electric current / (see Figure 10.10). If the solenoid is extremely long, 
the magnetic field inside it is uniform, and the field outside is zero. But the vector 
potential outside the solenoid is not zero; in fact (adopting the convenient gauge 
condition V- A = 0),!4 


A= E. (r >a), [10.66] 
271 


where © = ;ra?B is the magnetic flux through the solenoid. Meanwhile, the 
solenoid itself is uncharged, so the scalar potential gy is zero. In this case the 
Hamiltonian (Equation 10.65) becomes 


| ; 
H= > |l-^v? + q3 A3 + 2ihqA - v| [10.67] 
Zn 


^Y, Aharonov and D. Bohm, Phys. Rev. 115. 485 (1959). For a significant precursor. see 
W. Ehrenberg and R. E. Siday. Proc. Phys. Soc. London B62. 8 (1949). 


HSee. for instance. D. J. Griffiths, Introduction to Electrodynamics, 3rd ed., Prentice Hall. Upper 
Saddle River. NJ (1999). Equation 5.71. 
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FIGURE 10.10: Charged bead on a circular ring through which a long solenoid 


passes. 


But the wave function depends only on the azimuthal angle ø (8 = 7/2 and r = b), 
so V — (@/b)(d/d@), and the Schrödinger equation reads 


1| ma qoM ħq®d 
mun Maps rom h mds j —IiW(o)=E ; 10.68 
2m | b? d$? T (25) T zb? do y» ¥@) l 
This is a linear differential] equation with constant coefficients: 
Py dy 
— 2iB — = 0. 10.69 
ig? Pas tev [10.69] 
where 5 
o 2mb“ E 
B= ad es DZ gi [10.70] 
zh h- 
Solutions are of the form l 
y — Ae", [10.71] 
with , 
A-BtJBicre =E- IME. [10.72] 


Continuity of V (0). at @ = 27, requires that A be an integer: 


Bt > Vine =n. [10.73] 
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and it follows that 


TENE E ( qo ji (n 20, £1, 2...) [10.74] 
= —— | n — — i n =u Eh E [oh . 
"0 2mb? 2xh | 


The solenoid lifts the two-fold degeneracy of the bead-on-a-ring (Problem 2.46): 
Positive n, representing a particle traveling in the same direction as the current in 
the solenoid, has a somewhat lower energy (assuming q is positive) than negative n, 
describing a particle traveling in the opposite direction. More important, the allowed 
energies clearly depend on the field inside the solenoid, even though the field at the 
location of the particle is zero!? 

More generally, suppose a particle is moving through a region where B is 
zero (so V x A = 0), but A itself is zot. (lll assume that A is static, although 
the method can be generalized to time-dependent potentials.) The (time-dependent) 
Schródinger equation, 


| fh z aw 
e (v -«4) +V |W =ih—. [10.75] 
2m Vi ðt 


with potential energy V—which may or may not include an electrical contribution 
qq —can be simplified by writing 


y — ety, [10.76] 

where r 
g(r) = = Í A(r’) - dr’. [10.77] 

h Jo 


and © is some (arbitrarily chosen) reference point. Note that this definition makes 
sense only when V x A = 0 throughout the region in question—otherwise the line 
integral would depend on the path taken from Ó to r, and hence would not define 
a function of r. In terms of Y’, the gradient of Y is 


VY = el*(iVg)w! + e'* (Vw); 


but Vg — (q/h)A, so 
h lis 
(v — Zy y = ity, [10.78] 
I l 


I5[t is a peculiar property of superconducting rings that the enclosed flux is quantized: ® = 
(2zr h /q)n'. where n’ is an integer. In that case the effect is undetectable, since Ej, = (h? /2m b? )(n-n'y*. 
and (n +n’) is just another integer. (Incidentally. the charge q here turns out to be ^wice the charge of 
an electron; the superconducting clectrons are locked together in pairs.) However. flux quantization is 
enforced by the superconductor (which induces circulating currents to make up the difference), not by 
the solenoid or the electromagnetic field, and it does not occur in the (nonsuperconducting) example 
considered here. 


388 


Chapter10 The Adiabatic Approximation 


and it follows that 


h : ; 
(v 5 ^) y = —mpetv?y', [10.79] 
I 


Putting this into Equation 10.75, and cancelling the common factor é&/*, we are left 
with 
h? aw’ 
——WVwy 4 VW! = ih—. [10.80] 
2m ar 


Evidently Y’ satisfies the Schrödinger equation without A. If we can solve 
Equation 10.80, correcting for the presence of a (curl-free) vector potential will 
be trivial: Just tack on the phase factor e’®. 

Aharonov and Bohm proposed an experiment in which a beam of electrons 
is split in two, and passed either side of a long solenoid, before being recombined 
(Figure 10.11). The beams are kept well away from the solenoid itself, so they 
encounter only regions where B = 0. But A, which is given by Equation 10.66, is 
not zero, and (assuming V is the same on both sides), the two beams arrive with 
different phases: 


q qo 49 
g= [^ dr—-h 5 (: $). (rọ dọ) = to [10.81] 


The plus sign applies to the electrons traveling in the same direction as A—which 
is to say, in the same direction as the current in the solenoid. The beams arrive out 


B 


! 


Beam ; Beam 
split d b recombined 


Solenoid 


FIGURE 10.11: The Aharonov-Bohm effect: The electron beam splits, with half 
passing either side of a long solenoid. 
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tr -R) 


FIGURE 10.12: Particle confined to a box, by a potential V(r — R). 


of phase by an amount proportional to the magnetic flux their paths encircle: 


o 
phase difference — I: [10.82] 


This phase shift leads to measurable interference (Equation 10.47), which has been 
confirmed experimentally by Chambers and others.!Ó 

As Berry pointed out in his first paper on the subject, the Aharonov-Bohm 
effect can be regarded as an example of geometric phase. Suppose the charged 
particle is confined to a box (which is centered at point R outside the solenoid) by 
a potential V(r — R)—see Figure 10.12. (In a moment we're going to transport 
the box around the solenoid, so R will become a function of time, but for now it is 
just some fixed vector.) The eigenfunctions of the Hamiltonian are determined by 


2 
= [Fv = ace | + V(r = R) Vn = En. [10.83] 


We have already learned how to solve equations of this form: Let 
Vn = ey. [10.84] 


where!" 
go Ve 3 
g= 5 J A(r)- dr, [10.85] 
R 


IÓR. G. Chambers. Phys. Rev. Lett. 5, 3 (1960). 


'7It is convenient to set the reference point @ at the center of the box, for this guarantecs that we 
recover the original phase convention when we complete the journey around the solenoid. If you use a 
point in fixed space, for example. you'll have to readjust the phase "by hand." at the far end, because 
the path will have wrapped around the solenoid, circling regions where the curl of A does not vanish. 
This leads to exactly the same answer, but it’s a crude way to do it. In general. when choosing the phase 
convention for the eigenfunctions in Equation 10.9. you want to make sure that yy (x. T) = n(x, 0). 
so that no spurious phase changes are introduced. 
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and v^ satisfies the same eigenvalue equation, only with A — 0: 


2 
E +V(r—- J y, = Enba [10.86] 
Notice that y/ is a function only of the displacement (r — R), not (like y,) of r 
and R separately. 

Now let’s carry the box around the solenoid (in this application the process 
doesn't even have to be adiabatic). To determine Berry’s phase we must first 
evaluate the quantity (y; |Vg Wn). Noting that 


VR Yn = VR |e*v; (r — R)| = -iI ŽAR) ey, (r — R) + e8 Vry; — R). 
we find 


(Wn IVR Wn) 


= | e'8[wi (r — R)*e' E (r — R) + Vry (r — R)| dr 
=i LAR) - J [wi (r — R)]*V yi (r — R) d?r. [10.87] 


The V with no subscript denotes the gradient with respect to r, and I used the 
fact that Vg = —V, when acting on a function of (r — R). But the last integral is 
i /ħ times the expectation value of momentum, in an eigenstate of the Hamiltonian 
—(h?/2m)V? + V, which we know from Section 2.1 is zero. So 


(rs Vg) = -i ŽAR). [10.88] 


Putting this into Berry’s formula (Equation 10.45), we conclude that 


y (T) = 1 PAR- dR= 1 fo eh geet, [10.89] 
h h h 


which neatly confirms the Aharonov-Bohm result (Equation 10.82), and reveals 
that the Aharonov-Bohm effect is a particular instance of geometric phase.!8 
What are we to make of the Aharonov-Bohm effect? Evidently our classi- 
cal preconceptions are simply mistaken: There can be electromagnetic effects in 
regions where the fields are zero. Note however that this does not make A itself 


'8Tncidentally, in this case the analogy between Berry's phase and magnetic flux (Equation 10.50) 
is almost an identity: "B" = (q/h)B. 
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measurable—only the enclosed flux comes into the final answer, and the theory 
remains gauge invariant. 


Problem 10.7 


(a) Derive Equation 10.67 from Equation 10.65. 


(b) 


Derive Equation 10.79, starting with Equation 10.78. 


FURTHER PROBLEMS FOR CHAPTER 10 


x x xProblem 10.8 A particle starts out in the ground state of the infinite square well 
(on the interval 0 € x <a). Now a wall is slowly erected, slightly off-center:!? 


Vix) = f(08 (s = = €) . 


where f(t) rises gradually from 0 to co. According to the adiabatic theorem, the 
particle will remain in the ground state of the evolving Hamiltonian. 


(a) 


(b) 


(d) 


(e) 


Find (and sketch) the ground state at t — oo. Hint: This should be the ground 
state of the infinite square well with an impenetrable barrier at a/2 + e. Note 
that the particle is confined to the (slightly) larger left "half" of the well. 


Find the (transcendental) equation for the ground state of the Hamiltonian at 
time f. Answer: 


zsinz = T [cos z — cos(zó)]. 


where z = ka, T = maf (t)/ħ?, 6 = 2e/a, and k = J/2mE/h. 


Setting 6 = 0, solve graphically for z, and show that the smallest z goes from 
x to 2z as T goes from 0 to oo. Explain this result. 


Now set ô = 0.01 and solve numerically for z, using T = 0, 1, 5, 20. 100, 
and 1000. 


Find the probability P, that the particle is in the right "half" of the well, 
as a function of z and ô. Answer: P, = 1/[1 + (14/J/_)], where J, = 
[1 +6 — (1/z)sin (z(1 + 8))] sin^[(1 x 8)/2]. Evaluate this expression 
numerically for the T’s in part (d). Comment on your results. 


I? Julio Gea-Banacloche, Am. J. Phys. 70. 307 (2002) uses a rectangular barrier; the delta-function 


version was suggested by M. Lakner and J. Peternelj, Am. J. Phys. 71. 519 (2003), 
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(f) Plot the ground state wave function for those same values of T and 5. Note 
how it gets squeezed into the left half of the well, as the barrier grows.” 


x x «Problem 10.9 Suppose the one-dimensional harmonic oscillator (mass m, fre- 
quency o) is subjected to a driving force of the form F(t) = mo? f (t), where f(t) 
is some specified function (I have factored out mw? for notational convenience; 
f (t) has the dimensions of length). The Hamiltonian is 


h^ erue 
A(t) = —— — + -mw x" 


ics 
— mw xf (t). .90 
aad t3 mw x f (t) [10.90] 


Assume that the force was first turned on at time t = 0: f(t) = 0 for t < 0. 
This system can be solved exactly, both in classical mechanics and in quantum 
mechanics.?! 


(a) Determine the classical position of the oscillator, assuming it started from 
rest at the origin (x. (0) = xe (0) = 0). Answer: 


MORE o | F(t’) sinfw(t — tdr. [10.91] 
0 


(b) Show that the solution to the (time-dependent) Schródinger equation for this 
oscillator, assuming it started out in the nth state of the undriven oscillator 
(V (x. 0) = y, (x) where W(x) is given by Equation 2.61), can be written 
as 


L[- eee Dorem X) 292. P? roy oar! 
V (x. f) E À1Ó (re Dhor-emá G PHE fy fa? and] [10.92] 


(c) Show that the eigenfunctions and eigenvalues of H(t) are 


Va(x.f) = Vu(x— fy Ext) = ¢ + ;) hw — jo! f’. [10.93] 


(d) Show that in the adiabatic approximation the classical position (Equa- 
tion 10.91) reduces to x-(t) = f(t). State the precise criterion for adia- 
baticity, in this context, as a constraint on the time derivative of f. Hint: 
Write sin[o(t — t^] as (1/w)(d/dt’)cos[w(t — t’)] and use integration 
by parts. 


20Gea-Banacloche (footnote 19) discusses the evolution of the wave function without assuming 
the adiabatic theorem, and confirms these results in the adiabatic limit. 


2l See Y. Nogami, Am. J. Phys. 59. 64 (1991), and references therein. 
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(e) Confirm the adiabatic theorem for this example, by using the results in (c) 
and (d) to show that 


V (x, f) & yu (x, tell) ef, [10.94] 
Check that the dynamic phase has the correct form (Equation 10.39). Is the 


geometric phase what you would expect? 


Problem 10.10 The adiabatic approximation can be regarded as the first term in an 
adiabatic series for the coefficients cm (t) in Equation 10.12. Suppose the system 
starts out in the mth state; in the adiabatic approximation, it remains in the nth 
state, picking up only a time-dependent geometric phase factor (Equation 10.21): 


Cm(t) = ôn e ^O. 


(a) Substitute this into the right side of Equation 10.16 to obtain the "first cor- 
rection" to adiabaticity: 


Cm(t) = Cm (0) — Í (vn 


7s en) eia oina dt, [10.95] 


This enables us to calculate transition probabilities in the nearly adiabatic 
regime. To develop the “second correction,” we would insert Equation 10.95 
on the right side of Equation 10.16, and so on. 


€ 


As an example, apply Equation 10.95 to the driven oscillator (Problem 10.9). 
Show that (in the near-adiabatic approximation) transitions are possible only 
to the two immediately adjacent levels, for which 


l 
Cn4i(t) = |S va +1 f f (be dt’, 


: l . 5 
TR Lise ra | ft )e7i"" dr. 
V 2h 5 


(The transition probabilities are the absolute squares of these, of course.) 


CHAPTER 11 


SCATTERING 


11.1 INTRODUCTION 
11.1.1 Classical Scattering Theory 


Imagine a particle incident on some scattering center (say, a proton fired at a heavy 
nucleus). It comes in with energy E and impact parameter b, and it emerges at 
some scattering angle 0 —see Figure 11.1. (I'll assume for simplicity that the tar- 
get is azimuthally symmetrical, so the trajectory remains in one plane, and that 
the target is very heavy, so the recoil is negligible.) The essential problem of 
classical scattering theory is this: Given the impact parameter, calculate the scat- 
tering angle. Ordinarily, of course, the smaller the impact parameter, the greater 
the scattering angle. 


Scattering center Z 


FIGURE 11.1: The classical scattering problem, showing the impact parameter b and 
the scattering angle @. 
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FIGURE 11.2: Elastic hard-sphere scattering. 


Example 11.1  Hard-sphere scattering. Suppose the target is a billiard ball, 
of radius R, and the incident particle is a BB, which bounces off elastically 
(Figure 11.2). In terms of the angle a, the impact parameter is b = Rsina, and 
the scattering angle is 6 = x — 2a, so 


xz 0 0 
= Rsin{ >- -=)= —}. 11.1 
b Rsin (3 :) Reos (2) [ ] 
Evidently 
TE 2cos (b/R). it b<R, [11.2] 
0. if b >R. 


More generally, particles incident within an infinitesimal patch of cross- 
sectional area do will scatter into a corresponding infinitesimal solid angle dQ 
(Figure 11.3). The larger do is, the bigger dQ will be; the proportionality factor, 
D(@) = do /dQ, is called the differential (scattering) cross-section:! 


'This is terrible language: D isn't a differential, and it isn't a cross-section. To my ear. the 
words “differential cross-section" would attach more naturally to do. But I'm afraid we're stuck with 
this terminology. I should also warn you that the notation D(@) is nonstandard: Most people just call it 
da /d&) — which makes Equation 11.3 look like a tautology. I think it will be less confusing if we give 
the differential cross-section its own symbol. 
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FIGURE 11.3: Particles incident in the area do scatter into the solid angle d. 


In terms of the impact parameter and the azimuthal angle ¢, do = b db d$ and 


dQ = sin dé d$, so 
b 
D(@) = —— 
e) sin 


db 


al" [11.4] 


(Since 0 is typically a decreasing function of b, the derivative is actually nega- 
tive—hence the absolute value sign.) 


Example 11.2  Hard-sphere scattering (continued). In the case of hard-sphere 


scattering (Example 11.1) 
db l 6 
— =-—-=Rsin| —}. 11. 
dà 7 Rsin ( ;) [11.5] 


a 


SO 


: 2 
D(0) = R cos(0/2) (=) - R 


—. 11. 
sin Ó 2 4 piel 


This example is unusual, in that the differential cross-section is independent of 6. 


The total cross-section is the integral of D(@), over all solid angles: 


a= f Ded: [11.7] 
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roughly speaking, it is the total area of incident beam that is scattered by the target. 
For example, in the case of hard-sphere scattering, 


c = (R?4) | dQ = x R?, [11.8] 


which is just what we would expect: It's the cross-sectional area of the sphere; 
BB's incident within this area will hit the target, and those farther out will miss it 
completely. But the virtue of the formalism developed here is that it applies just as 
well to “soft” targets (such as the Coulomb field of a nucleus) that are not simply 
"hit-or-miss." 

Finally, suppose we have a beam of incident particles, with uniform intensity 
(or luminosity, as particle physicists call it) 


L = number of incident particles per unit area. per unit time. [11.9] 


The number of particles entering area do (and hence scattering into solid angle 
dQ), per unit time, is dN = Ldo = L D(0) dQ, so 
l dN 
D(0) = -—. 11.10 
e) Ld | ] 
This is often taken as the definition of the differential cross-section, because it 
makes reference only to quantities easily measured in the laboratory: If the detector 
accepts particles scattering into a solid angle dQ, we simply count the number 
recorded, per unit time, divide by dQ, and normalize to the luminosity of the 
incident beam. 


x x xProblem 11.1 Rutherford scattering. An incident particle of charge q; and kinetic 
energy E scatters off a heavy stationary particle of charge q2. 


(a) Derive the formula relating the impact parameter to the scattering angle.” 
Answer: b = (qiq2/8m eg E) cot(0/2). 


(b) Determine the differential scattering cross-section. Answer: 


2 
q1q2 2 
D(0) = | —————— |. 11.11 

e) Fearon l l 


(c) Show that the total cross-section for Rutherford scattering is infinite. We say 
that the 1/r potential has “infinite range”; you can’t escape from a Coulomb 
force. 


This isn't easy, and you might want to refer to a book on classical mechanics, such as Jerry 
B. Marion and Stephen T. Thornton, Classical Dynamics of Particles and Systems. 4th ed., Saunders, 
Fort Worth, TX (1995), Section 9.10. 
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e ikz 


FIGURE 11.4: Scattering of waves; incoming plane wave generates outgoing spheri- 
cal wave. 


11.1.2 Quantum Scattering Theory 


In the quantum theory of scattering, we imagine an incident plane wave, y '(z) = 
Ael*., traveling in the z direction, which encounters a scattering potential, produc- 
ing an outgoing spherical wave (Figure 11.4)? That is, we look for solutions to 
the Schródinger equation of the general form 


. ikr 
v(r.0)*A [^ + reo} . for large r. [11.12] 


(The spherical wave carries a factor of 1/r, because this portion of ||? must go 
like 1/r? to conserve probability.) The wave number & is related to the energy of 
the incident particles in the usual way: 


à 


2nE 


k 
h 


Il 


; [11.13] 


As before. I shall assume the target is azimuthally symmetrical; in the more general 
case the amplitude f of the outgoing spherical wave could depend on 6$ as well 
as 0. 


For the moment. there's not much quantum mechanics in this: what we're really talking about 
is the scattering of waves. as opposed to classical particles. and you could even think of Figure 11.4 
as a picture of water waves encountering a rock, or (better, since we're interested in three-dimensional 
scattering) sound waves bouncing off a basketball. In that case we'd write the wave function in the real 
form 
A [cos(kz) + f(8) cos(kr + 8)/r] ; 


and /(8) would represent the amplitude of the scattered sound wave in the direction 8. 
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vat 


FIGURE 11.5: The volume dV of incident beam that passes through area do in 
time di. 


The whole problem is to determine the scattering amplitude /(0); it tells 
you the probability of scattering in a given direction 0, and hence is related to the 
differential cross-section. Indeed, the probability that the incident particle, traveling 
at speed v, passes through the infinitesimal area do, in time df, is (see Figure 11.5) 


P = |Wineident |? dV = [Al (v dt) do. 


But this is equal to the probability that the particle scatters into the corresponding 
solid angle dQ: 


2 - 
P = |Wecattereal? dV = P — 35 — (v dt)r ? dQ. 


from which it follows that do = |f|? dQ, and hence 


ud 
DE = = For. [11.14] 


Evidently the differential cross-section (which is the quantity of interest to the 
experimentalist) is equal to the absolute square of the scattering amplitude (which 
is obtained by solving the Schródinger equation). In the following sections we 
will study two techniques for calculating the scattering amplitude: partial wave 
analysis and the Born approximation. 


Problem 11.2 Construct the analogs to Equation 11.12 for one-dimensional and 
two-dimensional scattering. 


11.2 PARTIAL WAVE ANALYSIS 


11.2.1 Formalism 


As we found in Chapter 4, the Schródinger equation for a spherically symmetrical 
potential V (r) admits the separable solutions 


y (r.6. 6) = R()Y; (8. 9). [11.15] 
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where Y,” is a spherical harmonic (Equation 4.32), and u(r) = r R(r) satisfies the 
radial equation (Equation 4.37): 


2 42 2 
nd : E u= ze [11.16] 


-— s +| VEe) + — 
r? 0) 2m r? 
At very large r the potential goes to zero, and the centrifugal contribution is neg- 
ligible, so 


The general solution is l 

u(r) 2 Ce" + De"; 
the first term represents an outgoing spherical wave, and the second an incoming 
one—for the scattered wave we evidently want D = 0. At very large r, then, 


el kr 


R(r) — 


as we already deduced (on physical grounds) in the previous section 
(Equation 11.12). 

That's for very large r (more precisely, for kr >> 1; in optics it would be 
called the radiation zone). As in one-dimensional scattering theory, we assume 
that the potential is "localized," in the sense that exterior to some finite scattering 
region it is essentially zero (Figure 11.6). In the intermediate region (where V can 
be ignored but the centrifugal term cannot),^ the radial equation becomes 


d?u LL 4 1) 
— u 


/ |ntermediate region 
V=0 


Radiation zone 


(kr }) 1) 


FIGURE 11.6: Scattering from a localized potential: the scattering region (darker 
shading), the intermediate region (lighter shading), and the radiation zone (where 
kr > 1). 


*What follows does not apply to the Coulomb potential. since 1/r goes to zero more slow! 
2 pply p £ y 
than 1/r^. as r — oo. and the centrifugal term does not dominate in this region. In this sense the 
Coulomb potential is not localized, and partial wave analysis ts inapplicable. 
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TABLE 11.1: Spherical Hankel functions, bt) (x) and pe (x). 


for x > | 


and the general solution (Equation 4.45) is a linear combination of spherical Bessel 
functions: 
u(r) = Arji(kr) + Brnj(kr). [11.18] 


However, neither j; (which is somewhat like a sine function) nor n; (which is a 
sort of generalized cosine function) represents an outgoing (or an incoming) wave. 
What we need are the linear combinations analogous to eif" and e^*^; these are 
known as spherical Hankel functions: 


AY (x) = j(x)-inj(x) h(x) = f(x) — inj (x). [11.19] 


The first few spherical Hankel functions are listed in Table 11.1. At large r, h” (kr) 
(the “Hankel function of the first kind") goes like et Jr. whereas h (kr) (the 


“Hankel function of the second kind") goes like LU /r; for outgoing waves, then, 
we need spherical Hankel functions of the first kind: 


R(r) ~ hf? (kr). [11.20] 


Thus the exact wave function, outside the scattering region (where V(r) — 
0), is 


V(.8.0) 2 A Ve" + Y Cus hy (er) Y" (0.0) E. [11.21] 


lm 


The first term is the incident plane wave, and the sum (with expansion coefficients 
Ci.m) represents the scattered wave. But since we are assuming the potential is 
spherically symmetric, the wave function cannot depend on @.° So only terms with 


There's nothing wrong with 0 dependence, of course, because the incoming plane wave defines 
a z direction, breaking the spherical symmetry. But the azimuthal symmetry remains; the incident plane 
wave has no @ dependence. and there is nothing in the scattering process that could introduce any $ 
dependence in the outgoing wave. 
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m = 0 survive (remember, Y!" ~ &/"9?). Now (from Equations 4.27 and 4.32) 
l 


YPO. $) = | E Rcoso. [11.22] 


where P} is the /th Legendre polynomial. It is customary to redefine the expansion 
coefficients, letting Cj, = ite /Ax QI Da: 


^o 
e^ Ek YO it + Nah [11.23] 


1-0 


V(r.8)— «| kr) noun]. 


You'll see in a moment why this peculiar notation is convenient; c; is called the 
lth partial wave amplitude. 

Now, for very large r the Hankel function goes like (—i)/t!e/A"/kr (Table 
11.1), so 


] kr 
V(r.0) «A [^ + HOR | : [11.24] 
7 
where " 
f(6) = YQI + 1) a Pi(cos?). [11.25] 


I=0 


This confirms more rigorously the general structure postulated in Equation 11.12, 
and tells us how to compute the scattering amplitude, f(@), in terms of the partial 
wave amplitudes (aj). The differential cross-section is 


D(8) = |F = 5 ^5 QI + DQI' + Daft ay Pi(cos@) Py(cos@). — [11.26] 
ior 
and the total cross-section is 
oO 
c —4n X Ql +1) la)’. [11.27] 
1-0 
(I used the orthogonality of the Legendre polynomials, Equation 4.34, to do the 
angular integration.) 


11.2.2 Strategy 


All that remains is to determine the partial wave amplitudes, a;, for the potential in 
question. This is accomplished by solving the Schródinger equation in the interior 
region (where V (r) is distinctly non-zero), and matching this to the exterior solution 
(Equation 11.23), using the appropriate boundary conditions. The only problem is 
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that as it stands my notation is hybrid: I used spherical coordinates for the scattered 
wave, but cartesian coordinates for the incident wave. We need to rewrite the wave 
function in a more consistent notation. 

Of course, e^: satisfies the Schrödinger equation with V = 0. On the other 
hand, I just argued that the general solution to the Schrödinger equation with 
V — 0 can be written in the form 


2 [Aim Ji (kr) + Bim ni(kr)] Y? (8. 9). 
lam 


In particular, then, it must be possible to express e/* in this way. But e/^* is finite 
at the origin, so no Neumann functions are allowed (n;(kr) blows up at r = 0), 
and since z = r cosÓ has no @ dependence, only m = 0 terms occur. The explicit 
expansion of a plane wave in terms of spherical waves is known as Rayleigh's 
formula: 


oo 
e^: = i! (21 + Dj kr) Pi (cos). [11.28] 
i=0 


Using this, the wave function in the exterior region can be expressed entirely in 
terms of r and 6: 


oo me 
Yr 2 AM i QI 4 1) [j (kr) + ik aj nid) P, (cos 0). [11.29] 
1-0 


Example 11.3 Quantum hard-sphere scattering. Suppose 


" < 
Voy] eraa [11.30] 
0, forr >a. 


The boundary condition, then, is 
V (a, 8) =0. [11.31] 


SO 
oo 
wives [ico +ika hi (ka) | P,(cos@) =0 [11.32] 
1—0 

for all 0, from which it follows (Problem 11.3) that 


i(k 
aj =i n, [11.33] 
kh, (ka) 


For a guide to the proof, see George Arfken and Hans-Jurgen Weber, Mathematical Methods 
for Physicists, 5th ed., Academic Press, Orlando (2000). Exercises 12.4.7 and 12.4.8. 
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In particular, the total cross-section is 


Eat 


That's the exact answer, but it's not terribly illuminating, so let's consider the 
limiting case of low-energy scattering: ka « 1. (Since k = 27/4, this amounts to 
saying that the wavelength is much greater than the radius of the sphere.) Referring 
to Table 4.4, we note that n;(z) is much larger than j;(z), for small z, so 


Ji (ka) 
h(k ka) 


[11.34] 


ie) JG) Con JG) 
hP) 5 — AG + ini(z) ni(z) 
Aded d ZET an [11.35] 
lanl +1 Leap] ^ C l 


and hence 


2] 41-42 
Lye EA ntt 


But we're assuming ka « 1, so the higher powers are negligible—in the low 
energy approximation the scattering is dominated by the / = 0 term. (This means 
that the differential cross-section is independent of 6, just as it was in the classical 
case.) Evidently 

o 7 4ra’, [11.36] 


for low energy hard-sphere scattering. Surprisingly, the scattering cross-section is 
four times the geometrical cross-section—in fact, ø is the total surface area of the 
sphere. This “larger effective size” is characteristic of long-wavelength scattering 
(it would be true in optics, as well); in a sense, these waves “feel” their way around 
the whole sphere, whereas classical particles only see the head-on cross-section. 


Problem 11.3 Prove Equation 11.33, starting with Equation 11.32. Hint: Exploit 
the orthogonality of the Legendre polynomials to show that the coefficients with 
different values of / must separately vanish. 


x «Problem 11.4 Consider the case of low-energy scattering from a spherical delta- 


function shell: 
V(r) — aó(r — a). 


where œ and a are constants. Calculate the scattering amplitude, f(@), the differ- 
ential cross-section, D(0), and the total cross-section, o. Assume ka < 1, so that 
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only the / = 0 term contributes significantly. (To simplify matters, throw out all 
l Æ 0 terms right from the start.) The main problem, of course, is to determine 
ag. Express your answer in terms of the dimensionless quantity 8 = 2maa/h?. 
Answer: o = Axa? ?/(14- BY. | 


11.3 PHASE SHIFTS 


Consider first the problem of one-dimensional scattering from a localized potential 
V (x) on the half-line x < O (Figure 11.7). I'll put a “brick wall” at x = 0, so a 
wave incident from the left, 


Wi(x) = Ae" (x < —a) [11.37] 


is entirely reflected 

W(x) = Be" (x < —a). [11.38] 
Whatever happens in the interaction region (—a < x « 0), the amplitude of the 
reflected wave has got to be the same as that of the incident wave, by conservation 
of probability. But it need not have the same phase. If there were no potential at 


all (just the wall at x = 0), then B = —A, since the total wave function (incident 
plus reflected) must vanish at the origin: 


Vo(x) = A (e = gt) (V(x) =0). [11.39] 
If the potential is not zero, the wave function (for x « —a) takes the form 


Vx) — A (e = grat) (V(x) #0). [11.40] 


FIGURE 11.7: One-dimensional scattering from a localized potential bounded on the 
right by an infinite wall. 
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The whole theory of scattering reduces to the problem of calculating the phase 
shift’ 5 (as a function of k, and hence of the energy E = A?k? /2m), for a spec- 
ified potential. We do this, of course, by solving the Schrédinger equation in the 
scattering region (—a < x « Q), and imposing appropriate boundary conditions 
(see Problem 11.5). The virtue of working with the phase shift (as opposed to the 
complex amplitude B) is that it illuminates the physics (because of conservation 
of probability, all the potential can do is shift the phase of the reflected wave) and 
simplifies the mathematics (trading a complex quantity —two real numbers— for a 
single real quantity). 

Now let's return to the three-dimensional case. The incident plane wave 
(Ael*:) carries no angular momentum in the z direction (Rayleigh's formula con- 
tains no terms with m Æ 0), but it includes all values of the tota! angular momen- 
tum (/ = 0, 1, 2, ...). Because angular momentum is conserved (by a spherically 
symmetric potential), each partial wave (labelled by a particular /) scatters inde- 
pendently, with (again) no change in amplitude? —only in phase. If there is no 
potential at all, then wo = Ae!*<, and the /th partial wave is (Equation 11.28) 


yP = ai! (21 + 1) ji(kr) Pi(cos8) (V(r) = 0). [11.41] 


But (from Equation 11.19 and Table 11.1) 


l l j ; 
c EE ENS (oe ee 2o uix 4, ; E six 
h(x) = 5 L (x) +h, c) » [ iy*'e"* ite | (x >> 1). [11.42] 


So for large r 
(21 + 1) 
2ikr 
The second term in square brackets represents an incoming spherical wave; it is 
unchanged when we introduce the scattering potential. The first term is the outgoing 

wave; it picks up a phase shift ô: 
21 + 1 
al : ) 
2ikr 


YP xA Gi - cnet | Pi(cos@) (V(r) = 0). [11.43] 


y D a [ett - 1ye] P(cos@) (V(r) Z0). [11.44] 


Think of it as a converging spherical wave (due exclusively to the ni?) component 
in e/^*), which is phase shifted 23, (see footnote 7) and emerges as an outgoing 
spherical wave (the h? ! part of e'™*? as well as the scattered wave itself). 


The 2 in front of 6 is conventional. We think of the incident wave as being phase shifted once 
on the way in, and again on the way out; by 6 we mean the “one way" phase shift, and the rotal is 
therefore 26. 

One reason this subject can be so confusing is that practically everything is called an "ampli- 
tude:" f(0) is the “scattering amplitude," a) is the “partial wave amplitude." but the first is a function 
of 0, and both are complex numbers. I'm row talking about "amplitude" in the original sense: the (real, 
of course) height of a sinusoidal wave. 
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In Section 11.2.1 the whole theory was expressed in terms of the partial wave 
amplitudes aj; now we have formulated it in terms of the phase shifts ô. There 
must be a connection between the two. Indeed, comparing the asymptotic (large 7) 
form of Equation 11.23 


2l 4-1) [ x, ir 21 4-1 Tm 
YOA C [ei - (=e + ud ek! pcos) [11.45] 
2ikr r 
with the generic expression in terms of 5; (Equation 11.44), we find? 
l ; l; 
ET. (e = 1) = xe" sin(a). [11.46] 


It follows in particular (Equation 11.25) that 


La ; 
f06)- Q1 + De sin(5;) P)(cos 4) [11.47] 
1=0 
and (Equation 11.27) 


oo 


o= i XO (21 + 1) sin? (8;). [11.48] 
1-0 

Again, the advantage of working with phase shifts (as opposed to partial wave 

amplitudes) is that they are easier to interpret physically, and simpler mathemat- 

ically—the phase shift formalism exploits conservation of angular momentum to 

reduce a complex quantity a; (two real numbers) to a single real one 4). 


Problem 11.5 A particle of mass m and energy E is incident from the left on the 
potential 


0. (x « —a). 
V(x)—-1-V». (-a<x <0). 
oc. (x > 0). 


(a) If the incoming wave is Ae/^* (where k = /2mE /h), find the reflected wave. 
Answer: 


z — ik’ cot(k’ Tm 
Ag Dif rem eI, where k' = /2m(E + Vo)/h. 


k + tk’ cot(k’a) 


9 Although l used the asymptotic form of the wave function to draw the connection between 
aj and ój. there is nothing approximate about the result (Equation 11.46). Both of them are constants 
(independent of r). and 6; means the phase shift in the asymptotic region (where the Hankel functions 
have settled down to e* "^^ Jkr). 
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(b) Confirm that the reflected wave has the same amplitude as the incident wave. 


(c) Find the phase shift 5 (Equation 11.40) for a very deep well (E « Vo). 
Answer: ó — —ka. 


Problem 11.6 What are the partial wave phase shifts (6j) for hard-sphere scattering 
(Example 11.3)? 


Problem 11.7 Find the S-wave (/ = 0) partial wave phase shift 59(k) for scattering 
from a delta-function shell (Problem 11.4). Assume that the radial wave function 
u(r) goes to 0 as r — oo. Answer: 


11.4 THE BORN APPROXIMATION 


ka 2maa 
-1 
— cot cot(ka) 4- m . where B= 
| B sin? (ka) d h? 
11.4.1 Integral Form of the Schrödinger Equation 
The time-independent Schrödinger equation, 
We cs 
——V^y + Vy = Ey, [11.49] 
2m 
can be written more succinctly as 
(V? 4 ky = Q. [11.50] 
where 
J/2m E 2 
k= = ' and Q= zzv. [11.51] 
1 1^ 


This has the superficial form of the Helmholtz equation; note, however, that the 
“inhomogeneous” term (Q) itself depends on y. 
Suppose we could find a function G(r) that solves the Helmholtz equation 
with a delta function "source: 
(V? +k)G(r) = Sr). [11.52] 


Then we could express y as an integral: 


y) = J G(r — ro) Q (ro) ^ro. [11.53] 
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For it is easy to show that this satisfies Schródinger's equation, in the form of 
Equation 11.50: 


(V? + ky (x) = J |? i66 - ro] Q (ro) d?ro 


= | Sæ- reo d*r9 = Q (r). 


G(r) is called the Green’s function for the Helmholtz equation. (In general, the 
Green’s function for a linear differential equation represents the “response” to a 
delta-function source.) 

Our first task!° is to solve Equation 11.52 for G(r). This is most easily accom- 
plished by taking the Fourier transform, which turns the differential equation into 
an algebraic equation. Let 


l : 
G(r) = sam | ete) d*s. [11.54] 
Then 
(v? + k?)G(r) = Gaye | Iv? die yer | g(s) ds. 
But u | 
Voss [11.55] 
and (see Equation 2.144) 
"pe J eT d's [11.56] 
— 27} i : 


so Equation 11.52 says 


: i l - 
amc) + k?)elS* o (s) d?s = On} fer d's. 


It follows!! that i 


rP shy 
Putting this back into Equation 11.54, we find: 


= 1 isr l 3 
G= P E Gras tS: [11 


g(s) [11.57] 


10Warning: You are approaching two pages of heavy analysis, including contour integ 
you wish, skip straight to the answer, Equation 11.65. 

I l'This is clearly sufficient, but it is also necessary, as you can show by combining the t 
into a single integral, and using Plancherel's theorem. Equation 2.102. 
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l FIGURE 11.8: Convenient coordinates for the 
`J integral in Equation 11.58. 


Now, r is fixed, as far as the s integration is concerned, so we may as well choose 
spherical coordinates (s, 0, $) with the polar axis along r (Figure 11.8). Thens-r = 
sr cos6, the @ integral is trivial (277), and the @ integral is 


rT isrcosé@ |7 2 si 
| gered unggp eer | LN [11.59] 
0 isr |o sT 
Thus 
1 2 [??ssin(sr) 1 °° ssin(sr) 
= — ————-ds = ds. 11.60 
epo (27)? r i K s2 S 7 agh [. pega [11.60] 


The remaining integral is not so simple. It pays to revert to exponential nota- 
tion, and factor the denominator: 


i e9 seis" TP se 
OO) = Si [5 ees oe een] 
= gh- h). "ose 


These two integrals can be evaluated using Cauchy's integral formula: 


f (2) 


(z — zo) 


z = 2xif (zo). [11.62] 


if zo lies within the contour (otherwise the integral is zero). In the present case 
the integration is along the real axis, and it passes right over the pole singularities 
at +k. We have to decide how to skirt the poles—I’ll go over the one at —k 
and under the one at +k (Figure 11.9). (You're welcome to choose some other 
convention if you like—even winding seven times around each pole—you’ll get 
a different Green's function, but, as I'll show you in a minute, they're all equally 
acceptable.) 

For each integral in Equation 11.61 we must "close the contour" in such a 
way that the semicircle at infinity contributes nothing. In the case of 74, the factor 


Section 11.4: The Born Approximation 411 


Im(s) 


FIGURE 11.9: Skirting the poles in the 
s--k s=+k Pels) contour integral (Equation 11.61). 


ei goes to zero when s has a large positive imaginary part; for this one we close 
above (Figure 11.10(a)). The contour encloses only the singularity at s = +k, so 


sel?" 1 sel?" 
h = ds = 2ri 
eed ew ZIP 


In the case of J, the factor e^!" goes to zero when s has a large negative imagi- 
nary part, so we close below (Figure 11.10(b)); this time the contour encloses the 
singularity at s = —k (and it goes around in the clockwise direction, so we pick 
up a minus sign): 


—isr —isr 
h=-$¢ id IET d 
s—kis-ck s—k 


Conclusion: 


= ix e^", [11.63] 


s=k 


= —ine". [11.64] 


sy——k 


G(r) = a | (ixe’*”) = (ize) | — Hun [11.65] 


This, finally, is the Green's function for the Helmholtz equation — the solution 
to Equation 11.52. (If you got lost in all that analysis, you might want to check 
the result by direct differentiation— see Problem 11.8.) Or rather, it is a Green's 
function for the Helmholtz equation, for we can add to G(r) any function Go(r) 
that satisfies the homogeneous Helmholtz equation: 


(V? + k?)Go(r) = 0; [11.66] 


(a) (b) 
FIGURE 11.10: Closing the contour in Equations 11.63 and 11.64. 
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clearly, the result (G + Go) still satisfies Equation 11.52. This ambiguity corre- 
sponds precisely to the ambiguity in how to skirt the poles—a different choice 
amounts to picking a different function Go(r). 

Returning to Equation 11.53, the general solution to the Schródinger equation 
takes the form 


m egi*Ir-rol 
— | — —— V (row (ro) d?ro, 11.67 
zm tl (ro) V (ro) d ro [11.67] 


y(r) = Vo(r) — 


where Wyo satisfies the free-particle Schrödinger equation, 
(V? 4 k?)yg = 0. [11.68] 


Equation 11.67 is the integral form of the Schródinger equation; it is entirely 
equivalent to the more familiar differential form. At first glance it looks like an 
explicit solution to the Schródinger equation (for any potential)— which is too 
good to be true. Don't be deceived: There's a y under the integral sign on the 
right hand side, so you can't do the integral unless you already know the solution! 
Nevertheless, the integral form can be very powerful, and it is particularly well 
suited to scattering problems, as we'll see in the following section. 


Problem 11.8 Check that Equation 11.65 satisfies Equation 11.52, by direct sub- 
stitution. Hint: V7(1/r) = —4z8?(r)."? 


x *Problem 11.9 Show that the ground state of hydrogen (Equation 4.80) satisfies 


the integral form of the Schródinger equation, for the appropriate V and E (note 
that E is negative, so k = ik, where k = . —2mE Jh). 


11.4.2 The First Born Approximation 


Suppose V (ro) is localized about rg = O (that is, the potential drops to zero outside 
some finite region, as is typical for a scattering problem), and we want to calculate 
V (r) at points far away from the scattering center. Then |r| > |ro| for all points 
that contribute to the integral in Equation 11.67, so 


r-r 
Ir- ro? = 7? +r -2r ro Sr? (1-2 ;) [11.69] 
p^ 


and hence 
Ir — rg| =r—F-ro. [11.70] 


I*See, for example, D. Griffiths, Introduction to Electrodynamics, 3rd ed. (Prentice Hall. Upper 
Saddle River, NJ. 1999). Section 1.5.3. 
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Let 
k = kf; [11.71] 
then 
eiklr—rol ~ gn giro [11.72] 
and therefore irc] i 
qr-ri a 


(In the denominator we can afford to make the more radical approximation 
lr — ro] = r; in the exponent we need to keep the next term. If this puzzles 
you, try writing out the next term in the expansion of the denominator. What we 
are doing is expanding in powers of the small quantity (9/1), and dropping all but 
the lowest order.) 

In the case of scattering, we want 


Vo(r) = Ae’, [11.74] 
representing an incident plane wave. For large r, then, 


m ee -ik-r 3 

5 — fe ? V (ro) y (ro) d ro. [11.75] 
2nh* r 
This is in the standard form (Equation 11.12), and we can read off the scattering 
amplitude: 


y (r) S Ae'^ — 


m 
2n h?A 
So far, this is exact. Now we invoke the Born approximation: Suppose the 


incoming plane wave is not substantially altered by the potential; then it makes 
sense to use 


f(8.0) = — J e To y (ro) y (ro) d? ro. [11.76] 


V (ro) © Vo(ro) = Aeta = Agro, [11.77] 


where 
k = kz, [11.78] 


inside the integral. (This would be the exact wave function, if V were zero; it is 
essentially a weak potential approximation.!?) In the Born approximation, then, 


fO p) = - | aS d°ro. [11.79] 


27h 


13 Generally, partial wave analysis is useful when the incident particle has low energy, for then 
only the first few terms in the series contribute significantly; the Born approximation applies when the 
potential is weak, compared to the incident energy. so the deflection is small. 
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FIGURE 11.11: Two wave vectors in the Born 
"- approximation: k’ points in the incident direction, k 
k'-kz in the scattered direction. 


(In case you have lost track of the definitions of k' and k, they both have magnitude 
k, but the former points in the direction of the incident beam, while the latter points 
toward the detector—see Figure 11.11; #(k — k’) is the momentum transfer in 
the process.) 

In particular, for low energy (long wavelength) scattering, the exponential 
factor is essentially constant over the scattering region, and the Born approximation 
simplifies to 


f(0.9) zx — ie 5 J V(r) d*r, (low energy). [11.80] 


20h 


(I dropped the subscript on r, since there is no likelihood of confusion at this point.) 


Example 11.4 Low-energy soft-sphere scattering.'* Suppose 


Vo. ifr <a, 
v=} 0 t5 [11.81] 
0. ifr>a. 
In this case the low-energy scattering amplitude is 
m 4 
0, p) S — V (2x4) ; 11.82 
f8. 6 TrA E [11.82] 
(independent of 0 and ¢), the differential cross-section is 
2 
do 5 2m Voa? V^ 
== = = | —— |] . 11.83 
and the total cross-section is 
2 
2m Voa? N^ 
c X 4m ea [11.84] 
3h* 


I You can't apply the Born approximation to hard-sphere scattering (Vy = oo)—t1he integral 
blows up. The point is that we assumed the potential is weak, and doesn't change the wave function 
much in the scattering region. But a hard sphere changes it. radically —from Ae!^* to zero. 
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For a spherically symmetrical potential, V (r) = V (r) — but not necessarily 
at low energy —the Born approximation again reduces to a simpler form. Define 


k =k’ —k, [11.85] 
and let the polar axis for the ro integral lie along x, so that 
(k’ — k) - ro = «rocos 6o. [11.86] 
Then 


f(0)z E el*rocos&o y (33 sin 8o dro do doo. [11.87] 
Th 


The o integral is trivial (277), and the @ integral is one we have encountered 
before (see Equation 11.59). Dropping the subscript on r, we are left with 


2 | oo 
f(0)z -— | rV(r)sin(«r)dr. (spherical symmetry). [11.88] 
i^k Jo 


The angular dependence of f is carried by x; in Figure 11.11 we see that 


k = 2k sin(0/2). [11.89] 


Example 11.5 Yukawa scattering. The Yukawa potential (which is a crude 
model for the binding force in an atomic nucleus) has the form 


e Hr 
V(r) 28——. [11.90] 
p 
where B and u are constants. The Born approximation gives 
2mp 
A? (u? +?) 


oo 
fF) z om | e^" sin(kr)dr = [11.91] 
Se h 


h 


(You get to work out the integral for yourself, in Problem 11.11.) 


Example 11.6 Rutherford scattering. If we put in B = q)q2/47eo, u = O, 
the Yukawa potential reduces to the Coulomb potential, describing the electrical 
interaction of two point charges. Evidently the scattering amplitude is 


2mqiqa 


; 11.92 
4x coh? k? | l 


f@)= 
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or (using Equations 11.89 and 11.51): 


Oe aes [11.93] 
The differential cross-section is the square of this: 

do qiq2 : 

707 ml ; [11.94] 


which is precisely the Rutherford formula (Equation 11.11). It happens that for 
the Coulomb potential, classical mechanics, the Born approximation, and quantum 
field theory all yield the same result. As they say in the computer business, the 
Rutherford formula is amazingly “robust.” 


xProblem 11.10 Find the scattering amplitude, in the Born approximation, for soft- 
sphere scattering at arbitrary energy. Show that your formula reduces to 
Equation 11.82 in the low-energy limit. 


Problem 11.11 Evaluate the integral in Equation 11.91, to confirm the expression 
on the right. 


xxProblem 11.12 Calculate the total cross-section for scattering from a Yukawa 
potential, in the Born approximation. Express your answer as a function of E. 


*Problem 11.13 For the potential in Problem 11.4, 
(a) calculate f(@), D(@), and o, in the low-energy Born approximation; 
(b) calculate f(@) for arbitrary energies, in the Born approximation; 


(c) show that your results are consistent with the answer to Problem 11.4, in the 
appropriate regime. 


11.4.3 The Born Series 


The Born approximation is similar in spirit to the impulse approximation in 
classical scattering theory. In the impulse approximation we begin by pretending 
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Actual 
F, trajectory 


Scattering center 


FIGURE 11.12: The impulse approximation assumes that the particle continues unde- 
flected, and calculates the transverse momentum delivered.. 


that the particle keeps. going in a straight line (Figure 11.12), and compute the 
transverse impulse that would be delivered to it in that case: 


r= f raa [11.95] 


If the deflection is relatively small, this should be a good approximation to the 
transverse momentum imparted to the particle, and hence the scattering angle is 


0 S tan"! (I/p), [11.96] 


where p is the incident momentum. This is, if you like, the “first-order” impulse 
approximation (the zeroth-order is what we started with: no deflection at all). 
Likewise, in the zeroth-order Born approximation the incident plane wave passes 
by with no modification, and what we explored in the previous section is really the 
first-order correction to this. But the same idea can be iterated to generate a series 
of higher-order corrections, which presumably converge to the exact answer. 

The integral form of the Schródinger equation reads 


V (r) = vo(r) + J g(r — ro) V (ro) v (ro) d?ro. [11.97] 


where Wo is the incident wave, 


en) =>. — [11.98] 


is the Green's function (into which I have now incorporated the factor 2in/h*, for 
convenience), and V is the scattering potential. Schematically, 


V = Wo + f ovv. [11.99] 


Suppose we take this expression for y, and plug it in under the integral sign: 


y= yo+ | evvo+ |f evevy. [11.100] 
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FIGURE 11.13: Diagrammatic interpretation of the Born series (Equation 11.101). 


Iterating this procedure, we obtain a formal series for w: 


v=w+favwo+| fevevwo+ | [ evevevvo+-. [11.101] 


In each integrand only the incident wavefunction (Wo) appears, together with more 
and more powers of g V. The first Born approximation truncates the series after the 
second term, but it is pretty clear how one generates the higher-order corrections. 

The Born series can be represented diagrammatically as shown in Figure 11.13. 
In zeroth order y is untouched by the potential; in first order it is “kicked” once, 
and then “propagates” out in some new direction; in second order it is kicked, prop- 
agates to a new location, is kicked again, and then propagates out; and so on. In this 
context the Green's function is sometimes called the propagator —it tells you how 
the disturbance propagates between one interaction and the next. The Born series 
was the inspiration for Feynman's formulation of relativistic quantum mechanics, 
which is expressed entirely in terms of vertex factors (V) and propagators (g), 
connected together in Feynman diagrams. 


Problem 11.14 Calculate 6 (as a function of the impact parameter) for Rutherford 
scattering, in the impulse approximation. Show that your result is consistent with 
the exact expression (Problem 11.1(a)), in the appropriate limit. 


x x xProblem 11.15 Find the scattering amplitude for low-energy soft-sphere scattering 
in the second Born approximation. Answer: —(2m Voa? /3h7)f1 — (4m Vga? /5h?]. 


FURTHER PROBLEMS FOR CHAPTER 11 


x x xProblem 11.16 Find the Green's function for the one-dimensional Schrödinger 
equation, and use it to construct the integral form (analogous to Equation 11.67). 
Answer: 


; oo 
V (x) = pax) - —- J gi y (xo) y (x9) dxo. [11.102] 
h*k —oo 
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x xProblem 11.17 Use your result in Problem 11.16 to develop the Born approxima- 
tion for one-dimensional scattering (on the interval —oo < x « oo, with no “brick 
wall" at the origin). That is, choose yo(x) = Aei** | and assume V (xo) = Wo(xo) 
to evaluate the integral. Show that the reflection coefficient takes the form: 


m M | (99 
R={— e” y (x) dx 
(s) TN m 


Problem 11.18 Use the one-dimensional Born approximation (Problem 11.17) to 
compute the transmission coefficient (T — 1 — R) for scattering from a delta 
function (Equation 2.114) and from a finite square well (Equation 2.145). Compare 
your results with the exact answers (Equations 2.141 and 2.169). 


2 


[11.103] 


Problem 11.19 Prove the optical theorem, which relates the total cross-section 
to the imaginary part of the forward scattering amplitude: 


o = A Im Oy. [11.104] 


Hint: Use Equations 11.47 and 11.48. 


Problem 11.20 Use the Born approximation to determine the total cross-section 
for scattering from a gaussian potential 


V(r) = Ae", 


Express your answer in terms of the constants A, u, and m (the mass of the incident 
particle), and k = /2mE/h, where E is the incident energy. 


CHAPTER 12 


AFTERWORD 


Now that you have (I hope) a sound understanding of what quantum mechanics 
says, I would like to return to the question of what it means — continuing the story 
begun in Section 1.2. The source of the problem is the indeterminacy associated 
with the statistical interpretation of the wave function. For V (or, more generally, 
the quantum state —it could be a spinor, for example) does not uniquely deter- 
mine the outcome of a measurement; all it provides is the statistical distribution of 
possible results. This raises a profound question: Did the physical system "actually 
have" the attribute in question prior to the measurement (the so-called realist view- 
point), or did the act of measurement itself "create" the property, limited only by 
the statistical constraint imposed by the wave function (the orthodox position) —or 
can we duck the question entirely, on the grounds that it is “metaphysical” (the 
agnostic response)? 

According to the realist, quantum mechanics is an incomplete theory, for even 
if you know everything quantum mechanics has to tell vou about the system (to wit: 
its wave function), still you cannot determine all of its features. Evidently there is 
some other information, external to quantum mechanics, which (together with Y) 
is required for a complete description of physical reality. 

The orthodox position raises even more disturbing problems, for if the act of 
measurement forces the system to "take a stand," helping to create an attribute 
that was not there previously,! then there is something very peculiar about the 


I This may be strange. but it is not mystical, as some popularizers would like to suggest. 
The so-called wave-particle duality. which Niels Bohr elevated to the status of a cosmic principle 
(complementarity). makes electrons sound like unpredictable adolescents. who sometimes behave like 
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FIGURE 12.1: Bohm's version of the 
ae D zs EPR experiment: A x° at rest decays into 
a 9 S> saan electron-positron pair. 


measurement process. Moreover, in order to account for the fact that an immedi- 
ately repeated measurement yields the same result, we are forced to assume that 
the act of measurement collapses the wave function, in a manner that is diffi- 
cult, at best, to reconcile with the normal evolution prescribed by the Schrödinger 
equation. 

In light of this, it is no wonder that generations of physicists retreated to the 
agnostic position, and advised their students not to waste time worrying about the 
conceptual foundations of the theory. 


12.1 THE EPR PARADOX 


In 1935, Einstein, Podolsky, and Rosen? published the famous EPR paradox, 
which was designed to prove (on purely theoretical grounds) that the realist position 
is the only sustainable one. I'll describe a simplified version of the EPR paradox, 
introduced by David Bohm. Consider the decay of the neutral pi meson into an 
electron and a positron: 


n? e^ et, 


Assuming the pion was at rest, the electron and positron fly off in opposite direc- 
tions (Figure 12.1). Now, the pion has spin zero, so conservation of angular momen- 
tum requires that the electron and positron are in the singlet configuration: 


] 
l-t- bt): [12.1] 


If the electron is found to have spin up, the positron must have spin down, and 
vice versa. Quantum mechanics can't tell you which combination you'll get, in 
any particular pion decay, but it does say that the measurements will be corre- 
lated, and you'll get each combination half the time (on average). Now suppose 


adults, and sometimes, for no particular reason. like children. I prefer to avoid such language. When I say 
that a particle does not have a particular attribute before its measurement, I have in mind. lor example. 


: I : T 
an electron in the spin state x = o J^ à measurement of the x-component of its angular momentum 


could return the value 7/2. or (with equal probability) the value —/1/2, but until the measurement is 
made it simply does not have a well-defined value of Sy. 


ŽA. Einstein, B. Podolsky. and N. Rosen. Phys. Rev. 47. 777 (1935). 
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we let the electron and positron fly way off—10 meters, in a practical experi- 
ment, or, in principle, 10 light years—and then you measure the spin of the 
electron. Say you get spin up. Immediately you know that someone 20 meters 
(or 20 light years) away will get spin down, if that person examines the 
positron. 

To the realist, there’s nothing surprising in this—the electron really had 
spin up (and the positron spin down) from the moment they were created ... 
it’s just that quantum mechanics didn’t know about it. But the “orthodox” view 
holds that neither particle had either spin up or spin down until the act of 
measurement intervened: Your measurement of the electron collapsed the wave 
function, and instantaneously "produced" the spin of the positron 20 meters (or 
20 light years) away. Einstein, Podolsky, and Rosen considered such “spooky 
action-at-a-distance" (Einstein’s words) preposterous. They concluded that 
the orthodox position is untenable; the electron and positron must have had 
well-defined spins all along, whether quantum mechanics can calculate them 
or not. 

The fundamental assumption on which the EPR argument rests is that no 
influence can propagate faster than the speed of light. We call this the princi- 
ple of locality. You might be tempted to propose that the collapse of the wave 
function is not instantaneous, but "travels" at some finite velocity. However, this 
would lead to violations of angular momentum conservation, for if we measured 
the spin of the positron before the news of the collapse had reached it, there 
would be a fifty-fifty probability of finding both particles with spin up. Whatever 
one might think of such a theory in the abstract, the experiments are unequiv- 
ocal: No such violation occurs—the (anti-)correlation of the spins is perfect. 
Evidently the collapse of the wave function—whatever its ontological status—is 
instantaneous. 


Problem 12.1 Entangled states. The singlet spin configuration (Equation 12.1) 
is the classic example of an entangled state —a two-particle state that cannot be 
expressed as the product of two one-particle states, and for which, therefore, one 
cannot really speak of “the state” of either particle separately. You might wonder 
whether this is somehow an artifact of bad notation—maybe some linear combi- 
nation of the one-particle states would disentangle the system. Prove the following 
theorem: 


Consider a two-level system, |ġ4) and |¢p), with (¢;|¢;) = 4;;. (For example, 
|da} might represent spin up and |ġp} spin down.) The two-particle state 


a&|Ga(1)) lon (2)) + Llor) ba (2)) 
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(with œ Æ 0 and B Æ 0) cannot be expressed as a product 
IV. Qv; Q2). 


for any one-particle states |y) and |W). 


Hint: Write |y.) and |W,) as linear combinations of |@,) and |b). 


12.2 BELL’S THEOREM 


Einstein, Podolsky, and Rosen did not doubt that quantum mechanics is correct, 
as far as it goes; they only claimed that it is an incomplete description of physical 
reality: The wave function is not the whole story—some other quantity, A, is 
needed, in addition to W, to characterize the state of a system fully. We call A 
the “hidden variable” because, at this stage, we have no idea how to calculate 
or measure it.? Over the years, a number of hidden variable theories have been 
proposed, to supplement quantum mechanics;* they tend to be cumbersome and 
implausible, but never mind—until 1964 the program seemed eminently worth 
pursuing. But in that year J. S. Bell proved that any local hidden variable theory 
is incompatible with quantum mechanics.’ 

Bell suggested a generalization of the EPR/Bohm experiment: Instead of ori- 
enting the electron and positron detectors along the same direction, he allowed 
them to be rotated independently. The first measures the component of the electron 
spin in the direction of a unit vector a, and the second measures the spin of the 
positron along the direction b (Figure 12.2). For simplicity, let's record the spins 
in units of /1/2; then each detector registers the value +1 (for spin up) or —1 (spin 
down), along the direction in question. A table of results, for many 2° decays, 
might look like this: 


electron positron product 
= -l 
+1 +1 
+1 -1 


—] -] 
-1 +1 


3The hidden variable could be a single number, or it could be a whole collection of numbers: 
perhaps A is to be calculated in some future theory. or maybe it is for some reason of principle 
incalculable. Tt hardly matters, All 1 am asserting is that there must be something —if only a list ol the 
outcomes of every possible experiment—associated with the system prior to a measurement, 

4D, Bohm. Phys. Rev. 85. 166. 180 (1952). 


?Bell's original paper (Physics 1. 195 (1964)) is a gem: brief, accessible. and beautifully written. 
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FIGURE 12.2: Bell's version of the EPR-Bohm experiment: detectors independently 
oriented in directions a and b. 


Bell proposed to calculate the average value of the product of the spins, for 

a given set of detector orientations. Call this average P(a, b). If the detectors are 

parallel (b = a), we recover the original EPRB configuration; in this case one is 

spin up and the other spin down, so the product is always —1, and hence so too is 
the average: 

P(a,a) = -1. [12.2] 


By the same token, if they are anti-parallel (b = —a), then every product is +1, so 
P(a, —a) = +1. [12.3] 


For arbitrary orientations, quantum mechanics predicts 


(see Problem 4.50). What Bell discovered is that this result is incompatible with 
any local hidden variable theory. 

The argument is stunningly simple. Suppose that the “complete” state of the 
electron/positron system is characterized by the hidden variable(s) A (A varies, in 
some way that we neither understand nor control, from one pion decay to the next). 
Suppose further that the outcome of the electron measurement is independent of 
the orientation (b) of the positron detector—which may, after all, be chosen by the 
experimenter at the positron end just before the electron measurement is made, and 
hence far too late for any subluminal message to get back to the electron detector. 
(This is the locality assumption.) Then there exists some function A(a. A) which 
gives the result of an electron measurement, and some other function B(b, A) for 
the positron measurement. These functions can only take on the values + 1:6 


A(a,A)= t1; B(b.A)= +1. [12.5] 


6This already concedes far more than a classical determinist would be prepared to allow, for it 
abandons any notion that the particles could have well-defined angular momentum vectors with simul- 
taneously determinate components. But never mind—the point of Bell's argument is to demonstrate 
that quantum mechanics is incompatible with any local deterministic theory — even one that bends over 
backwards to be accommodating. 
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When the detectors are aligned, the results are perfectly (anti)-correlated: 
A(a, A) = —B(a. À). [12.6] 


for all À. 
Now, the average of the product of the measurements is 


P (a. b = f pA. DBO. A) dA. [12.7] 


where p(A) is the probability density for the hidden variable. (Like any probability 
density, it is nonnegative, and satisfies the normalization condition f o(A) dA = 1, 
but beyond this we make no assumptions about (A); different hidden variable 
theories would presumably deliver quite different expressions for p.) In view of 
Equation 12.6, we can eliminate B: 


Pa. b) =- | 2004. DAD, D dd. [12.8] 
If c is any other unit vector, 
Pta, b) — P(a.e) =- | o [Aa DAD.) - Ata, WAED] [129] 
Or, since [A(b, A)]* = I: 
Pta.b) - Pa.c) =~ f p[i- AG. WAC. A)]A(a, NACH, Ada. [12.10] 


But it follows from Equation 12.5 that —1 < [A(a.A)A(b.A)] < +1; moreover 
pà) — A(b, A)A(e. A)] = 0, so 


Pa. b) — Pa. 9) x | o0- Atb.2)ACe. 29] d. [12.11] 


|P(a. b) — P(a.c)| < 1+ P(b.c). [12.12] 


This is the famous Bell inequality. It holds for any local hidden variable theory 
(subject only to the minimal requirements of Equations 12.5 and 12.6), for we have 
made no assumptions whatever as to the nature or number of the hidden variable(s), 
or their distribution (p). 

But it is easy to show that the quantum mechanical prediction (Equation 12.4) 
is incompatible with Bell’s inequality. For example, suppose all three vectors lie in 


or, more simply: 
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FIGURE 12.3: An orientation of the detectors that demon- 
a strates quantum violations of Bell's inequality. 


a plane, and c makes a 45* angle with a and b (Figure 12.3); in this case quantum 
mechanics says 


P(a.b) 20. P(a,c) = P(b.c) = —0.707, 
which is patently inconsistent with Bell's inequality: 
0.707 € 1 — 0.707 = 0.293. 


With Bell's modification, then, the EPR paradox proves something far more 
radical than its authors imagined: If they are right, then not only is quantum mechan- 
ics incomplete, it is downright wrong. On the other hand, if quantum mechanics 
is right, then no hidden variable theory is going to rescue us from the nonlocality 
Einstein considered so preposterous. Moreover, we are provided with a very simple 
experiment to settle the issue once and for all. 

Many experiments to test Bell’s inequality were performed in the '60's and 
'70's, culminating in the work of Aspect, Grangier, and Roger. The details do not 
concern us here (they actually used two-photon atomic transitions, not pion decays). 
To exclude the remote possibility that the positron detector might somehow "sense" 
the orientation of the electron detector, both orientations were set quasi-randomly 
after the photons were already in flight. The results were in excellent agreement 
with the predictions of quantum mechanics, and clearly incompatible with Bell's 
inequality.® 

Ironically, the experimental confirmation of quantum mechanics came as some- 
thing of a shock to the scientific community. But not because it spelled the demise 
of "realism" — most physicists had long since adjusted to this (and for those who 
could not, there remained the possibility of nonlocal hidden variable theories, 


7A, Aspect. P. Grangier. and G. Roger. Phys. Rev. Lett. 49, 91 (1982). For more recent experi- 
ments sce G. Weihs et al.. Phys. Rev. Lett. 81. 5039 (1998). 


SBell's theorem involves averages and it is conceivable that an apparatus such as Aspects 
contains some secret bias which selects out a nonrepresentative sample, thus distorting the average. In 
1989. an improved version of Bells theorem was proposed, in which a single measurement suffices 
to distinguish between the quantum prediction and that of any local hidden variable theory. See D. 
Greenberger. M. Horne. A. Shimony. and A. Zeilinger. Am. J. Phys. 58, 1131 (1990) and N. David 
Mernmin. Am. J. Phys. 58. 731 (1990). 
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Projector Bug 


Screen 


FIGURE 12.4: The shadow of the bug moves across the screen at a velocity v’ greater 
than c, provided the screen is far enough away. 


to which Bell's theorem does not apply?). The real shock was the demonstra- 
tion that nature itself is fundamentally nonlocal. Nonlocality, in the form of the 
instantaneous collapse of the wave function (and for that matter also in the sym- 
metrization requirement for identical particles) had always been a feature of the 
orthodox interpretation, but before Aspect's experiment it was possible to hope 
that quantum nonlocality was somehow a nonphysical artifact of the formalism, 
with no detectable consequences. That hope can no longer be sustained, and we 
are obliged to reexamine our objection to instantaneous action-at-a-distance. 

Why are physicists so squeamish about superluminal influences? After all, 
there are many things that travel faster than light. If a bug flies across the beam 
of a movie projector, the speed of its shadow is proportional to the distance to 
the screen; in principle, that distance can be as large as. you like, and hence the 
shadow can move at arbitrarily high velocity (Figure 12.4). However, the shadow 
does not carry any energy; nor can it transmit a message from one point to another 
on the screen. A person at point X cannot cause anything to happen at point Y by 
manipulating the passing shadow. 

On the other hand, a causal influence that propagated faster than light would 
carry unacceptable implications. For according to special relativity there exist iner- 
tial frames in which such a signal propagates backward in time —the effect pre- 
ceding the cause— and this leads to inescapable logical anomalies. (You could, for 
example, arrange to kill your infant grandfather. Not a good idea!) The question is, 
are the superluminal influences predicted by quantum mechanics and detected by 


?It is a curious twist of fate that the EPR paradox. which assumed locality in order to prove 
realism, led finally to the demise of locality and left the issue of realism undecided— the outcome (as 
Mermin put it) Einstein would have liked least. Most physicists today consider that if they can't have 
local realism, there's not much point in realism at all, and for this reason nonlocal hidden variable 
theories occupy a rather peripheral place. Still. some authors— notably Bell himself, in Speakable and 
Unspeakable in Quantum Mechanics (Cambridge University Press. 1987)—argue that such theories 
offer the best hope of bridging the conceptual gap between the measured system and the measuring 
apparatus, and for supplying an intelligible mechanism for the collapse of the wave function. 
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Aspect causal, in this sense, or are they somehow ethereal enough (like the motion 
of the shadow) to escape the philosophical objection? 

Well, let's consider Bell's experiment. Does the measurement of the electron 
influence the outcome of the positron measurement? Assuredly it does — otherwise 
we cannot account for the correlation of the data. But does the measurement of the 
electron cause a particular outcome for the positron? Not in any ordinary sense 
of the word. There is no way the person manning the electron detector could use 
his measurement to send a signal to the person at the positron detector, since he 
does not control the outcome of his own measurement (he cannot make a given 
electron come out spin up, any more than the person at X can affect the passing 
shadow of the bug). It is true that he can decide whether to make a measurement 
at all, but the positron monitor, having immediate access only to data at his end of 
the line, cannot tell whether the electron was measured or not, for the lists of data 
compiled at the two ends, considered separately, are completely random. It is only 
when we compare the two lists later that we discover the remarkable correlations. 
In another reference frame the positron measurements occur before the electron 
measurements, and yet this leads to no logical paradox —the observed correlation 
is entirely symmetrical in its treatment, and it is a matter of indifference whether we 
say the observation of the electron influenced the measurement of the positron, or 
the other way around. This is a wonderfully delicate kind of influence whose only 
manifestation is a subtle correlation between two lists of otherwise random data. 

We are led, then, to distinguish two types of influence: the "causal" variety, 
which produce actual changes in some physical property of the receiver, detectable 
by measurements on that subsystem alone, and an “ethereal” kind, which do not 
transmit energy or information, and for which the only evidence is a correlation 
in the data taken on the two separate subsystems—a correlation which by its 
nature cannot be detected by examining either list alone. Causal influences cannot 
propagate faster than light, but there is no compelling reason why ethereal ones 
should not. The influences associated with the collapse of the wave function are of 
the latter type, and the fact that they "travel" faster than light may be surprising, 
but it is not, after all, catastrophic.!? 


12.3 THE NO-CLONE THEOREM 


Quantum measurements are typically destructive, in the sense that they alter the 
state of the system measured. This is how the uncertainty principle is enforced in 
the laboratory. You might wonder why we don't just make a bunch of identical 
copies (clones) of the original state, and measure them, leaving the system itself 


IÜAn enormous amount has been written about Bell's theorem. My favorite is an inspired essay 
by David Mermin in Physics Today (April 1985. page 38). An extensive bibliography will be found in 
L. E. Ballentine, Am. J. Phys. 55. 785 (1987). 
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unscathed. It can’t be done. Indeed, if you could build a cloning device (a “quantum 
Xerox machine"), quantum mechanics would be out the window. 

For example, it would then be possible to send superluminal messages using 
the EPRB experiment. Say the message to be transmitted, from the operator of the 
positron detector to the operator of the electron detector, is either "yes" or "no." 
If the message is "yes," the sender measures S. (of the positron). Never mind 
what result she gets —all that matters is that she makes the measurement, for this 
means that the electron is now in the definite state ¢ or | (never mind which). The 
receiver immediately makes a million clones of the electron, and measures $, on 
each of them. If they all yield the same answer (never mind which answer), we 
can be pretty sure that the electron was in fact measured, so the message is "yes." 
If half of them are spin up, and half spin down, then the electron was definitely 
not measured, and the message is "no." 

But you can't make a quantum Xerox machine, as Wootters, Zurek, and Dieks 
proved in 1982.!! Schematically, we want the machine to take as input a particle 
in state |y} (the one to be copied), plus a second particle in state | X) (the “blank 
sheet of paper"), and spit out two particles in the state |y} (original plus copy): 


IV)IX) > Iv)lv). [12.13] 


Suppose we have made a device that successfully clones the state |Y}: 


IIX) > Ivillvi). [12.14] 


and also works for state |yW2): 


I2) X) — Iva)lva) [12.15] 


(ly) and |y?) might be spin up and spin down, for example, if the particle is an 
electron). So far, so good. But what happens when we feed in a linear combination 
|v) = eda) + Bla)? Evidently we get? 


IV) X) — edyi)lvn) + B1yz)lva). [12.16] 


which is not at all what we wanted— what we wanted was 


Iu)1X) — [WY I) = lel) + BI»): vai) 4-812)] 
= a7 |i) Wi) +67 leo) Wo) 4-e BEA a2) + H2) |a]. — 12:17] 


IW. K. Wootters and W. H. Zurek. Nature 299. 802 (1982); D. Dieks, Phys. Lett. A 92, 271 
(1982). 

'2This assumes that the device acts linearly on the state |y). as it must. since the time-dependent 
Schrédinger equation (which presumably governs the process) is linear. 
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You can make a machine to clone spin-up electrons and spin-down electrons, but 
it’s. going to fail for any nontrivial linear combinations. It’s as though you bought 
a Xerox machine that copies vertical lines perfectly, and also horizontal lines, but 
completely distorts diagonals. 


12.4 SCHRODINGER’S CAT 


The measurement process plays a mischievous role in quantum mechanics: It is 
here that indeterminacy, nonlocality, the collapse of the wave function, and all 
the attendant conceptual difficulties arise. Absent measurement, the wave func- 
tion evolves in a leisurely and deterministic way, according to the Schródinger 
equation, and quantum mechanics looks like a rather ordinary field theory (much 
simpler than classical electrodynamics, for example, since there is only one field 
(V), instead of two (E and B), and it's a scalar). It is the bizarre role of the 
measurement process that gives quantum mechanics its extraordinary richness 
and subtlety. But what, exactly, is a measurement? What makes it so different 
from other physical processes?!? And how can we tell when a measurement has 
occurred? 

Schródinger posed the essential question most starkly, in his famous cat 
paradox: ^ 


A cat is placed in a steel chamber, together with the following hellish 
contraption.... In a Geiger counter there is a tiny amount of radioactive 
substance, so tiny that maybe within an hour one of the atoms decays, but 
equally probably none of them decays. If one decays then the counter triggers 
and via a relay activates a little hammer which breaks a container of cyanide. 
If one has left this entire system for an hour, then one would say the cat is 
living if no atom has decayed. The first decay would have poisoned it. The 
wave function of the entire system would express this by containing equal 
parts of the living and dead cat. 


At the end of the hour, the wave function of the cat has the schematic form 
1 


Jg U tive + V'dead)- [12.18] 


y = 


'3There is a school of thought that rejects this distinction, holding that the system and the 
measuring apparatus should be described by one great big wave function which itself evolves according 
to the Schrédinger equation. In such theories there is no collapse of the wave function, but one must 
typically abandon any hope of describing individual events —quantum mechanics (in this view) applies 
only to ensembles of identically prepared systems. See, for example, Philip Pearle Am. J. Phys. 35. 742 
(1967). or Leslie E. Ballentine, Quantum Mechanics: A Modern Development, 2nd ed., World Scientific, 
Singapore (1998). 


Hg, Schrödinger, Naturwiss. 48, 52 (1935); translation by Josef M. Jauch, Foundations of Quan- 
tuin Mechanics. Addison-Wesley, Reading (1968), p. 185. 
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The cat is neither alive nor dead, but rather a linear combination of the two, until a 
measurement occurs—until, say, you peek in the window to check. At that moment 
your observation forces the cat to “take a stand”: dead or alive. And if you find 
him to be dead, then it’s really you who killed him, by looking in the window. 

Schrédinger regarded this as patent nonsense, and I think most physicists 
would agree with him. There is something absurd about the very idea of a macro- 
scopic object being in a linear combination of two palpably different states. An 
electron can be in a linear combination of spin up and spin down, but a cat simply 
cannot be in a linear combination of alive and dead. How are we to reconcile this 
with the orthodox interpretation of quantum mechanics? 

The most widely accepted answer is that the triggering of the Geiger counter 
constitutes the “measurement,” in the sense of the statistical interpretation, not 
the intervention of a human observer. It is the essence of a measurement that 
some macroscopic system is affected (the Geiger counter, in this instance). The 
measurement occurs at the moment when the microscopic system (described by 
the laws of quantum mechanics) interacts with the macroscopic system (described 
by the laws of classical mechanics) in such a way as to leave a permanent record. 
The macroscopic system itself is not permitted to occupy a linear combination of 
distinct states. !5 

I would not pretend that this is an entirely satisfactory resolution, but at least 
it avoids the stultifying solipsism of Wigner and others, who persuaded themselves 
that it is the involvement of human consciousness that constitutes a measurement 
in quantum mechanics. Part of the problem is the word “measurement” itself, 
which certainly carries a suggestion of human participation. Heisenberg proposed 
the word “event,” which might be preferable. But I’m afraid “measurement” is so 
ingrained by now that we’re stuck with it. And, in the end, no manipulation of the 
terminology can completely exorcise this mysterious ghost. 


12.5 THE QUANTUM ZENO PARADOX 


The collapse of the wave function is undoubtedly the most peculiar feature of this 
whole bizarre story. [t was introduced on purely theoretical grounds, to account 
for the fact that an immediately repeated measurement reproduces the same value. 
But surely such a radical postulate must carry directly observable consequences. In 


I3Of course, in some ultimate sense the macroscopic system is itself described by the laws of 
quantum mechanics. But wave functions, in the first instance. describe individual elementary particles: 
the wave function of a macroscopic object would be a monstrously complicated composite. built out 
of all the wave functions of its 10?? constituent particles. Presumably somewhere in the statistics of 
large numbers macroscopic linear combinations become extremely improbable. Indeed, if you were 
able somehow to get a damped pendulum (say) into a linear combination of macroscopically distinct 
quantum states, it would. in a tiny fraction of the damping time, revert to an ordinary classical state. 
This phenomenon is called decoherence. See. for example. R. Omnes. The Interpretation of Quantum 
Mechanics (Princeton, 1994), Chapter 7. 
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it’s going to fail for any nontrivial linear combinations. It’s as though you bought 
a Xerox machine that copies vertical lines perfectly, and also horizontal lines, but 
completely distorts diagonals. 
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The measurement process plays a mischievous role in quantum mechanics: It is 
here that indeterminacy, nonlocality, the collapse of the wave function, and all 
the attendant conceptual difficulties arise. Absent measurement, the wave func- 
tion evolves in a leisurely and deterministic way, according to the Schródinger 
equation, and quantum mechanics looks like a rather ordinary field theory (much 
simpler than classical electrodynamics, for example, since there is only one field 
(V), instead of two (E and B), and it’s a scalar). It is the bizarre role of the 
measurement process that gives quantum mechanics its extraordinary richness 
and subtlety. But what, exactly, is a measurement? What makes it so different 
from other physical processes?!? And how can we tell when a measurement has 
occurred? 

Schródinger posed the essential question most starkly, in his famous cat 
paradox: !4 


A cat is placed in a steel chamber, together with the following hellish 
contraption.... In a Geiger counter there is a tiny amount of radioactive 
substance, so tiny that maybe within an hour one of the atoms decays, but 
equally probably none of them decays. If one decays then the counter triggers 
and via a relay activates a little hammer which breaks a container of cyanide. 
If one has left this entire system for an hour, then one would say the cat is 
living if no atom has decayed. The first decay would have poisoned it. The 
wave function of the entire system would express this by containing equal 
parts of the living and dead cat. 


At the end of the hour, the wave function of the cat has the schematic form 


] 
= Y jvc ea . 12.18 
V J; Ut + Waead) [ ] 


l3There is a school of thought that rejects this distinction, holding that the system and the 
measuring apparatus should be described by one great big wave function which itself evolves according 
to the Schródinger equation. In such theories there is no collapse of the wave function, but one must 
typically abandon any hope of describing individual events— quantum mechanics (in this view) applies 
only to ensembles of identically prepared systems. See, for example, Philip Pearle Am. J. Phys. 35, 742 
(1967). or Leslie E. Ballentine, Quantum Mechanics: À Modern Development, 2nd ed., World Scientific, 
Singapore (1998). 


Hg. Schródinger, Naturwiss. 48, 52 (1935); translation by Josef M. Jauch. Foundations of Quan- 
tum Mechanics, Addison-Wesley. Reading (1968), p. 185. 
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measurement occurs—until, say, you peek in the window to check. At that moment 
your observation forces the cat to “take a stand”: dead or alive. And if you find 
him to be dead, then it’s really you who killed him, by looking in the window. 

Schródinger regarded this as patent nonsense, and I think most physicists 
would agree with him. There is something absurd about the very idea of a macro- 
scopic object being in a linear combination of two palpably different states. An 
electron can be in a linear combination of spin up and spin down, but a cat simply 
cannot be in a linear combination of alive and dead. How are we to reconcile this 
with the orthodox interpretation of quantum mechanics? 

The most widely accepted answer is that the triggering of the Geiger counter 
constitutes the "measurement," in the sense of the statistical interpretation, not 
the intervention of a human observer. It is the essence of a measurement that 
some macróscopic system is affected (the Geiger counter, in this instance). The 
measurement occurs at the moment when the microscopic system (described by 
the laws of quantum mechanics) interacts with the macroscopic system (described 
by the laws of classical mechanics) in such a way as to leave a permanent record. 
The macroscopic system itself is not permitted to occupy a linear combination of 
distinct states. 

I would not pretend that this is an entirely satisfactory resolution, but at least 
it avoids the stultifying solipsism of Wigner and others, who persuaded themselves 
that it is the involvement of human consciousness that constitutes a measurement 
in quantum mechanics. Part of the problem is the word "measurement" itself, 
which certainly carries a suggestion of human participation. Heisenberg proposed 
the word “event,” which might be preferable. But I'm afraid “measurement” is so 
ingrained by now that we're stuck with it. And, in the end, no manipulation of the 
terminology can completely exorcise this mysterious ghost. 


12.5 THE QUANTUM ZENO PARADOX 


The collapse of the wave function is undoubtedly the most peculiar feature of this 
whole bizarre story. It was introduced on purely theoretical grounds, to account 
for the fact that an immediately repeated measurement reproduces the same value. 
But surely such a radical postulate must carry directly observable consequences. In 


I3Of course. in some ultimate sense the macroscopic system is itself described by the laws of 
quantum mechanics. But wave functions, in the first instance, describe individual elementary particles; 
the wave function of a macroscopic object would be a monstrously complicated composite. built out 
of all the wave functions of its 10°? constituent particles. Presumably somewhere in the statistics of 
large numbers macroscopic linear combinations become extremely improbable. Indeed, if you were 
able somehow to get a damped pendulum (say) into a linear combination of macroscopically distinct 
quantum states, it would, in a tiny fraction of the damping time, revert to an ordinary classical state. 
This phenomenon is called decoherence. See, for example. R. Omnes. The Interpretation of Quantum 
Mechanics (Princeton, 1994). Chapter 7. 
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1977 Misra and Sudarshan!Ó proposed what they called the quantum Zeno effect 
as a dramatic experimental demonstration of the collapse of the wave function. 
Their idea was to take an unstable system (an atom in an excited state, say), and 
subject it to repeated measurements. Each observation collapses the wave function, 
resetting the clock, and it is possible by this means to delay indefinitely the expected 
transition to the lower state.!" 

Specifically, suppose a system starts out in the excited state W2, which has 
a natural lifetime r for transition to the ground state V. Ordinarily, for times 
substantially less than t, the probability of a transition is proportional to f (see 
Equation 9.42); in fact, since the transition rate is 1/r, 


t 
Pop = —. [12.19] 
T 


If we make a measurement after a time f, then, the probability that the system is 
still in the upper state is 


Pst) 21— L, [12.20] 


Suppose we do find it to be in the upper state. In that case the wave function 
collapses back to y2, and the process starts all over again. If we make a second 
measurement, at 27, the probability that the system is still in the upper state is 


evidently 
2 


(1 E 3j pe. [12.21] 


T T 


which is the same as it would have been had we never made the first measurement 
at r. This is what one would naively expect; if it were the whole story there would 
be nothing gained by repeatedly observing the system, and there would be no 
quantum Zeno effect. 

However, for extremely short times, the probability of a transition is not pro- 
portional to t, but rather to t? (see Equation 9.39):!8 


Po.) =ar. [12.22] 


In this case the probability that the system is still in the upper state after the two 
measurements is 


( -ary & 1 — 2et?, [12.23] 


!6B. Misra and E. C. G. Sudarshan, J. Math. Phys. 18, 756 (1977). 


U"rhis effect doesn't have much to do with Zeno. but it is reminiscent of the old adage. "a 
watched pot never boils,” so it is sometimes called the watched pot phenomenon. 

I5In the argument leading to linear time dependence, we assumed that the function sin*(Qr/2)/ 22 
in Equation 9.39 was a sharp spike. However. the width of the "spike" is of order Aw = 42/1, and for 
extremely short t this approximation fails, and the integral becomes (12/4) | plw)dw. 
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whereas if we had never made the first measurement it would have been 
1 —a(2r)? = 1 — 4ar. [12.24] 


Evidently our observation of the system after time t decreased the net probability 
of a transition to the lower state! 

Indeed, if we examine the system at n regular intervals, from t = O out 
to t = T (that is, we make measurements at T/n, 2T/n, 3T/n.....T), the 
probability that the system is still in the upper state at the end is 


(1 —a(T/n)*)" xl- Zr, [12.25] 
H 


which goes to 1 in the limit n — oo: A continuously observed unstable system 
never decays at all! Some authors regard this as an absurd conclusion, and a proof 
that the collapse of the wave function is fallacious. However, their argument hinges 
on a rather loose interpretation of what constitutes “observation.” If the track of a 
particle in a bubble chamber amounts to "continuous observation," then the case is 
closed, for such particles certainly do decay (in fact, their lifetime is not measurably 
extended by the presence of the detector). But such a particle is only intermittently 
interacting with the atoms in the chamber, and for the quantum Zeno effect to 
occur the successive measurements must be made extremely rapidly. in order to 
catch the system in the 7? regime. 

As it turns out, the experiment is impractical for spontaneous transitions, but 
it can be done using induced transitions, and the results are in excellent agreement 
with the theoretical predictions.!? Unfortunately, this experiment is not as com- 
pelling a confirmation of the collapse of the wave function as its designers hoped; 
the observed effect can be accounted for in other ways.?0 


LLELLLLI 


In this book I have tried to tell a consistent and coherent story: The wave 
function (V) represents the state of a particle (or system); particles do not in gen- 
eral possess specific dynamical properties (position, momentum, energy, angular 
momentum, etc.) until an act of measurement intervenes; the probability of getting 
a particular value in any given experiment is determined by the statistical inter- 
pretation of V; upon measurement the wave function collapses, so that an imme- 
diately repeated measurement is certain to yield the same result. There are other 
possible interpretations — nonlocal hidden variable theories, the "many worlds" pic- 
ture, “consistent histories," ensemble models, and others— but I believe this one 


(Ow. M. Itano, D. J. Heinzen. J. J. Bollinger. and D. J. Wineland, Phys. Rev. A 41. 2295 (1990). 


L, E. Ballentine. Found. Phys. 20, 1329 (1990): T. Petrosky. S. Tasaki. and I. Prigogine. Phys. 
Lett. A 151. 109 (1990). 
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is conceptually the simplest, and certainly it is the one shared by most physicists 
today.?! It has stood the test of time, and emerged unscathed from every exper- 
imental challenge. But I cannot believe this is the end of the story; at the very 
least, we have much to learn about the nature of measurement and the mechanism 
of collapse. And it is entirely possible that future generations will look back, from 
the vantage point of a more sophisticated theory, and wonder how we could have 
been so gullible. 


*!'See Daniel Styer et al.. Am. J. Phys. 70. 288 (2002). 


APPENDIX 


LINEAR ALGEBRA 


Linear algebra abstracts and generalizes the arithmetic of ordinary vectors, such as 
those we encounter in first-year physics. The generalization is in two directions: 
(1) We allow the scalars to be complex numbers, and (2) we do not restrict ourselves 
to three dimensions. 


A.1 VECTORS 


A vector space consists of a set of vectors (|a), |8), |y). ...), together with a set 
of scalars (a, b, c, ...),! which is closed? under two operations: vector addition 
and scalar multiplication. 

e Vector Addition 


The "sum" of any two vectors is another vector: 


la) + |B) = ly). [A.1] 
Vector addition is commutative: 
la) + |B) = |B) + la). [A.2] 


lFor our purposes. the scalars will be ordinary complex numbers. Mathematicians can tell you 
about vector spaces over more exotic fields. but such objects play no role in quantum mechanics. Note 
that a. B. y ... are not (ordinarily) numbers: they are names (labels) — "Charlie." for instance, or 
“P43A-9GL,” or whatever you care to use to identify the vector in question. 


"That is to say, these operations are always well-defined, and will never carry you outside the 
vector space. 
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and associative: 


la) + (8) + Iy) = (lo) 4-18) + ly). [A.3] 


There exists a zero (or null) vector,? |0}, with the property that 


lo) + |0) = la). [A.4] 


for every vector |o). And for every vector |o) there is an associated inverse 
vector (| — «)),* such that 


la) + | — a) = 10). [A.5] 


Scalar Multiplication 


The “product” of any scalar with any vector is another vector: 


ala) = |y). [A.6] 


Scalar multiplication is distributive with respect to vector addition: 


a(|a) + 18)) = ala) + al), [A.7] 


and with respect to scalar addition: 


(a+ b)|a) = ala) + bia). [A.8] 


It is also associative with respect to the ordinary multiplication of scalars: 


a(b|a)) = (ab)|a). [A.9] 


Multiplication by the scalars 0 and 1 has the effect you would expect: 
Ola) = |0): 1lo) = la). [A.10] 
Evidently | — o) = (—1)|æ) (which we write more simply as — |o)). 


There's a lot less here than meets the eye—all I have done is to write down 


in abstract language the familiar rules for manipulating vectors. The virtue of such 
abstraction is that we will be able to apply our knowledge and intuition about the 
behavior of ordinary vectors to other systems that happen to share the same formal 
properties. 


"It is customary. where no confusion can arise, to write the null vector without the adorning 


bracket: |0) — 0. 


*'This is funny notation. since œ is not a number. I’m simply adopting the name "—Charlie" for 


the inverse of the vector whose name is "Charlie." More natural terminology will suggest itself in a 


mome 


nt. 
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A linear combination of the vectors |o), |6), |v), ..., is an expression of 
the form 
ala) + 548) +cly)+---. [A.11] 


A vector |à) is said to be linearly independent of the set |a), |B), |v), .... 
if it cannot be written as a linear combination of them. (For example, in three 
dimensions the unit vector & is linearly independent of ? and j, but any vector 
in the xy plane is linearly dependent on i and j.) By extension, a set of vectors 
is "linearly independent" if each one is linearly independent of all the rest. A 
collection of vectors is said to span the space if every vector can be written as 
a linear combination of the members of this set.” A set of linearly independent 
vectors that spans the space is called a basis. The number of vectors in any basis 
is called the dimension of the space. For the moment we shall assume that the 
dimension (7) is finite. 
With respect to a prescribed basis 


lei). le2).... . len). [A.12] 


any given vector 
la) = ailei) t alez) + +--+ an len), [A.13] 


is uniquely represented by the (ordered) n-tuple of its components: 
la) <> (a1,a2. ... . an). [A.14] 


“= 


It is often easier to work with the components than with the abstract vectors them- 
selves. To add vectors, you add their corresponding components: 


lv) + |B) < (a1 +b), a2 o bo... La + by: [A.15] 
to multiply by a scalar you multiply each component: 
cla) «€» (caj. cas, ... . Cay): [A.16] 
the null vector is represented by a string of zeroes: 
10) <> (0,0.... , 0); [A.17] 
and the components of the inverse vector have their signs reversed: 


| — æ) <> (—a41. —a2.... . —@)- [A.18] 


5A set of vectors that spans the space is also called complete. though I personally reserve that 
word for the infinite-dimensional case. where subtle questions of convergence may arisc. 
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The only disadvantage of working with components is that you have to commit 
yourself to a particular basis, and the same manipulations will look very different 
to someone working in a different basis. 


Problem A.1 Consider the ordinary vectors in 3 dimensions (ay? + a,j + a-k), 
with complex components. 


(a) Does the subset of all vectors with a- = 0 constitute a vector space? If so, 
what is its dimension; if not, why not? 


(b) What about the subset of all vectors whose z component is 1? Hint: Would 
the sum of two such vectors be in the subset? How about the null vector? 


(c) What about the subset of vectors whose components are all equal? 


*Problem A.2 Consider the collection of all polynomials (with complex coeffi- 
cients) of degree less than N in x. 


(a) Does this set constitute a vector space (with the polynomials as “vectors’’)? 
If so, suggest a convenient basis, and give the dimension of the space. If not, 
which of the defining properties does it lack? 


(b) What if we require that the polynomials be even functions? 


(c) What if we require that the leading coefficient (i.e., the number multiplying 
.N-1 : 
X ) be 1? 


(d) What if we require that the polynomials have the value 0 at x — 1? 


(e) What if we require that the polynomials have the value 1 at x — 0? 


Problem A.3 Prove that the components of a vector with respect to a given basis 
are unique. 


A.2 INNER PRODUCTS 


In three dimensions we encounter two kinds of vector products: the dot product and 
the cross product. The latter does not generalize in any natural way to n-dimensional 
vector spaces, but the former does —in this context it is usually called the inner 
product. The inner product of two vectors (|o) and |8)) is a complex number, 
which we write as (o |f). with the following properties: 
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(Bla) = (a|B)*. [A.19] 
(ala) 20, and (ala) =0¢ |o) = l0), [A.20] 
(a|(b|B) + cly)) = b(a|B) + clay). [A.21] 


Apart from the generalization to complex numbers, these axioms simply codify the 
familiar behavior of dot products. A vector space with an inner product is called 
an inner product space. 

Because the inner product of any vector with itself is a nonnegative number 
(Equation A.20), its square root is real —we call this the norm of the vector: 


lall = y (era): [A.22] 


it generalizes the notion of “length.” A unit vector (one whose norm is 1) is said 
to be normalized (the word should really be “normal,” but I guess that sounds too 
anthropomorphic). Two vectors whose inner product is zero are called orthogonal 
(generalizing the notion of "perpendicular"). A collection of mutually orthogonal 
normalized vectors, 

(a;|à ;) = ij. [A.23] 


is called an orthonormal set. It is always possible (see Problem A.4), and almost 
always convenient, to choose an orthonormal basis; in that case the inner product 
of two vectors can be written very neatly in terms of their components: 


(alb) = ajbi 4c a3063 +: + abs. [A.24] 
the norm (squared) becomes 
(elo) = lai + lea? +++ a. [A.25] 
and the components themselves are 
a; = (e;|a). [A.26] 


(These results generalize the familiar formulas a -b = ayby + ayby + azb;, lal? = 
a2 + a? + a2, anda, =7-a, dy = J:a, a; = k - a, for the three-dimensional 
orthonormal basis 7, f, k.) From now on we shall always work in orthonormal 
bases, unless it is explicitly indicated otherwise. 

Another geometrical quantity one might wish to generalize is the angle 
between two vectors. In ordinary vector analysis cos@ = (a-b)/|a||b|. But because 
the inner product is in general a complex number, the analogous formula (in an 
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arbitrary inner product space) does not define a (real) angle 0. Nevertheless, it is 
still true that the absolute value of this quantity is a number no greater than 1, 


Ial)? < (ala) (818). [A.27] 


(This important result is known as the Schwarz inequality; the proof is given in 
Problem A.5.) So you can, if you like, define the angle between |o) and |£) by 
the formula 


SIP Ela) [A.28] 
a |o 


xProblem A.4 Suppose you start out with a basis (|e). |ez). ... . |es)) that is nor 
orthonormal. The Gram-Schmidt procedure is a systematic ritual for generating 
from it an orthonormal basis ([ej). le5). ... . le;)). It goes like this: 


(i) Normalize the first basis vector (divide by its norm): 


lei) 
lej) 2 —. 
lleill 


(ii) Find the projection of the second vector along the first, and subtract it off: 
le) — (e|e2)le}). 


This vector is orthogonal to |e,); normalize it to get |e3). 


(iii) Subtract from |e3) its projections along |e) and |e): 


lex) — (eles) le}) — (e5le)le5). 


This is orthogonal to |e) and |e); normalize it to get |e4). And so on. 


Use the Gram-Schmidt procedure to orthonormalize the 3-space basis |e,) = 
0 DE (00j +k. le) = (OO? (7 + (DE. lez} = (0)? + (28) 7 + (O)k. 


Problem A.5 Prove the Schwarz inequality (Equation A.27). Hint: Let |y) = 
I8) — ((a|B)/{ala)) |x), and use (y|y) > 0. 


Problem A.6 Find the angle (in the sense of Equation A.28) between the vectors 
la) = (1 +i)? + (Dj + G)k and |B) = (4 — DE + (0j + (2 — 20k. 


Problem A.7 Prove the triangle inequality: ||(|o) + |8))|| x lloll + ISl. 
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A.3 MATRICES 


Suppose you take every vector (in 3-space) and multiply it by 17, or you rotate every 
vector by 39° about the z-axis, or you reflect every vector in the xy plane—these 
are all examples of linear transformations. A linear transformation® (f) takes 
each vector in a vector space and “transforms” it into some other vector (|o) — 
|o") = f |a), subject to the condition that the operation be linear: 


T (ala) + b|B)) = a(T |a)) + b(T|)), [A.29] 


for any vectors |o). |B) and any scalars a, b. 
If you know what a particular linear transformation does to a set of basis 
vectors, you can easily figure out what it does to any vector. For suppose that 


Île) = Tiler) + Tale) +--+» + Tut len), 
T |e2) = Tizlei) + Tozer) +--+ Toile). 


Tlen) = Tylei) + Tonlez) +- + Tanlen): 
or, more compactly, 


n 
fle) 2 9 Tjay Q213.....n). [A.30] 


i=] 


If |o) is an arbitrary vector, 


n 
la) = alei) + aler) +- + anler) = Š ajlej), [A.31] 
al 
then 
n n n n n 
Pla) = Y'a; (fle) = 33 aTe) = 35 | X Tuo; | led. 1.32] 
j=l j=l i=l i=l \j=1 
Evidently T takes a vector with components A1. d2.... 4g, into a vector with 
components” 
n 
a; = > Tijaj. [A.33] 
j=! 


ÓIn this chapter "ll use a hat (^) to denote linear transformations: this is not inconsistent with 
my convention in the text (putting hats on operators). for (as we shall see) quantum operators are linear 
transformations. 

"Notice the reversal of indices between Equations A.30 and A.33. This is not a typographical 


error. Another way of putting it (switching / +> j in Equation A.30) is that if the components transform 
with 7;;. the basis vectors transform with T;;. 
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Thus the n? elements 7;; uniquely characterize the linear transformation Î (with 
respect to a given basis), just as the n components a; uniquely characterize the 
vector |o) (with respect to the same basis): 

T e (Ty.Ti2.--- Tas). [A.34] 
If the basis is orthonormal, it follows from Equation A.30 that 


Tij = teilflej). [A.35] 


It is convenient to display these complex numbers in the form of a matrix:? 


Ti T ... Tin 
D Ta ain Ton 

T= . ‘ . [A.36] 
Ti Tn2 see Tun 


The study of linear transformations reduces then to the theory of matrices. The sum 
of two linear transformations (S + T) is defined in the natural way: 


($+ T)|o) = Sla) + T Ja); [A.37] 


this matches the usual rule for adding matrices (you add their corresponding 
elements): 
U=SiTs Ui; = Sij + Tij. [A.38] 


The product of two linear transformations (ST) is the net effect of performing 
them in succession — first T, then S: 


la) =T la); læ”) = Slo’) = S(T|o) = ST Ja). [A.39] 


What matrix U represents the combined transformation U = $T? It's not hard to 
work it out: 


n 


Hi n n n n 
a= D8) Ys (Ema) =F (Esr) a= Donen 
j=! j=! k=1 k=1 \j=l k=1 
Evidently 
n 
U-ST 2 Ux = » Sij Tx. [A.40] 


j=l 


ŠTII use boldface capital letters, sans serif, to denote square matrices. 
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This is the standard rule for matrix multiplication—to find the ikth element of 
the product ST, you look at the ith row of S, and the kth column of T, multi- 
ply corresponding entries, and add. The same prescription allows you to multiply 
rectangular matrices, as long as the number of columns in the first matches the 
number of rows in the second. In particular, if we write the n-tuple of components 
of |a) as an n x | column matrix (or “column vector")? 


"T 
a? 


a=]. |. [A.41] 
An 
the transformation rule (Equation A.33) can be expressed as a matrix product: 
a’ — Ta. [A.42] 


Now some matrix terminology: 


e The transpose of a matrix (which we shall write with a tilde: T) is the same 
set of elements, but with rows and columns interchanged. In particular, the 
transpose of a column matrix is a row matrix: 


a= (ay 03. xx an) ] [A.43] 


For a square matrix taking the transpose amounts to reflecting in the main 
diagonal (upper left to lower right): 


Tu Due Tn 

" T2 Ta ... du 

T= . i : f [A.44] 
Tin Ton tee Tnn 


A (square) matrix is symmetric if it is equal to its transpose; it is antisym- 
metric if this operation reverses the sign: 


symmetric: T =T; antisymmetric : T = —T. [A.45] 


e The (complex) conjugate of a matrix (which we denote, as usual, with an 
asterisk, T*), consists of the complex conjugate of every element: 


d a 
r-|? ? ae spa [A.46] 
T* T* T* at 


nl na 7^7 nn n: 


9I] use boldface lowercase letters. sans serif, for row and column matrices. 
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A matrix is real if all its elements are real, and imaginary if they are all 
imaginary: 
real : T = T; 


imaginary : T* = —T. [A.47] 


e The hermitian conjugate (or adjoint) of a matrix (indicated by a dagger, 
T^) is the transpose conjugate: 


Th T; uum T 
EA LE D 2 Noe T E 
TeT-2|. . J a' —a'—(ap aj ... aj). 
T", T5, PESE hi [A.48] 


A square matrix is hermitian (or self-adjoint) if it is equal to its hermitian 
conjugate; if hermitian conjugation introduces a minus sign, the matrix is 
skew hermitian (or anti-hermitian): 


hermitian: T' = T: skew hermitian : T' = —T. [A.49] 

In this notation the inner product of two vectors (with respect to an orthonor- 
mal basis— Equation A.24), can be written very neatly as a matrix product: 
(|B) = a'b. [A.50] 


Notice that each of the three operations defined in this paragraph, if applied twice, 
returns you to the original matrix. 

Matrix multiplication is not, in general, commutative (ST + TS); the differ- 
ence between the two orderings is called the commutator:!? 


[S. T] = ST - TS. [A.51] 
The transpose of a product is the product of the transposes in reverse order: 
(ST) = TS. [A.52] 
(see Problem A.11). and the same goes for hermitian conjugates: 
(SD! = T'S'. [A.53] 


The unit matrix (representing a linear transformation that carries every vector 
into itself) consists of ones on the main diagonal, and zeroes everywhere else: 


| 0 «9 
OF Al. ater, D 

l= [A.54] 
0 0 l 


I The commutator only makes sense for square matrices. of course: for rectangular matrices the 
two orderings wouldn't even be the same size. 
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In other words, 
li = dj}. [A.55] 


The inverse of a (square) matrix (written T is defined in the obvious way:!! 
T 'T=TT! =l. [A.56] 


A matrix has an inverse if and only if its determinant!? is nonzero; in fact, 


Hea 


T= , 5 
dat’ Id 


where C is the matrix of cofactors (the cofactor of element T;; is (—1)*/ times 
the determinant of the submatrix obtained from T by erasing the ith row and the 
jth column). A matrix that has no inverse is said to be singular. The inverse of a 
product (assuming it exists) is the product of the inverses in reverse order: 
(ST)! =T's"!, [A.58] 
A matrix is unitary if its inverse is equal to its hermitian conjugate: !? 
unitary : U' = U^. [A.59] 
Assuming the basis is orthonormal, the columns of a unitary matrix constitute an 


orthonormal set, and so too do its rows (see Problem A.12). Linear transformations 
represented by unitary matrices preserve inner products, since (Equation A.50) 


(a’|B’) = a'tb’ = (Ua) (Ub) = a'U'Ub = a'b = (alf). [A.60] 


xProblem A.8 Given the following two matrices: 


—] 1 i 2 0 -i 
A-|2 0 3 B-[0O 1 0 
2i c2 2 i 3 2 


ll Note that the left inverse is equal to the right inverse, for if AT = | and TB = |, then 
(multiplying the second on the left by A and invoking the first) we get B = A. 


1?] assume you know how to evaluate determinants. If not. scc M. Boas. Mathematical Methods 
in the Physical Sciences, 2nd ed. (John Wiley. New York, 1983), Section 3.3. 

lin a real vector space (that is, one in which the scalars are real) the hermitian conjugate is 
the same as the transpose. and a unitary matrix is orthogonal: O — O^. For example. rotations in 
ordinary 3-space are represented by orthogonal matrices. 
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compute: (a) A + B, (b) AB, (c) [A,B], (d) A, (e) A*, (f) AŤ, (g) det(B), and 
(h) B^. Check that BB^! = I. Does A have an inverse? 


xProblem A.9 Using the square matrices in Problem A.8, and the column 


matrices 
i 2 
a—|2i|l. b-2l[(1-DnD 
2 0 


find: (a) Aa, (b) a! b, (c) aBb, (d) ab’. 


Problem A.10 By explicit construction of the matrices in question, show that any 
matrix T can be written 


(a) as the sum of a symmetric matrix S and an antisymmetric matrix A; 
(b) as the sum of a real matrix R and an imaginary matrix M; 


(c) as the sum of a hermitian matrix H and a skew-hermitian matrix K. 


xProblem A.11 Prove Equations A.52, A.53, and A.58. Show that the product of 
two unitary matrices is unitary. Under what conditions is the product of two her- 
mitian matrices hermitian? Is the sum of two unitary matrices necessarily unitary? 
Is the sum of two hermitian matrices hermitian? 


Problem A.12 Show that the rows and columns of a unitary matrix constitute 
orthonormal sets. 


Problem A.13 Noting that det(T) = det(T), show that the determinant of a her- 
mitian matrix is real, the determinant of a unitary matrix has modulus 1 (hence the 
name), and the determinant of an orthogonal matrix is either +1 or —1. 


A.4 CHANGING BASES 


The components of a vector depend, of course, on your (arbitrary) choice of basis, 
and so do the elements of the matrix representing a linear transformation. We 
might inquire how these numbers change when we switch to a different basis. 


Section A.4: Changing Bases 447 


The old basis vectors, |e;) are—like all. vectors—linear combinations of the new 
ones, | fj): 


lei) = Sulf) + Sail fo) +--+ Sail fn). 
leo) = Sil fi) + Sal fo) +--+ + Shoal fn). 


len) = Sinl fi) F Sanl f2) re Sun | fn). 
(for some set of complex numbers S;;), or, more compactly, 
n 
lee». Suc dg ede sm. [A.61] 
i=l 


This is itself a linear transformation (compare Equation A.30),!4 and we know 
immediately how the components transform: 


n 
af = V yas. [A.62] 
j=l 


(where the superscript indicates the basis). In matrix form 
al = Sat. [A.63] 


What about the matrix representing a linear transformation 7—how is it 
modified by a change of basis? Well, in the old basis we had (Equation A.42) 


a” = T'a", 
and Equation A.63—multiplying both sides by S^! —entails'5 a^ = S~'a/, so 
a’! = Sa” = S(T’a’) = ST/S laf. 


Evidently 
T/-ST'S |. [A.64] 


In general, two matrices (T; and T?) are said to be similar if T? = ST S~! for 
some (nonsingular) matrix S. What we have just found is that matrices representing 


l#NŅotice, however. the radically different perspective: In this case we're talking about one and 
the same vector, referred to two completely different bases, whereas before we were thinking of a 
completely different vector, referred to the same basis, 


I5Note that S7! certainly exists—if S were singular, the | f;)’s would not span the space, so 
they wouldn't constitute a basis. 
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the same linear transformation, with respect to different bases, are similar. Inciden- 
tally, if the first basis is orthonormal, the second will also be orthonormal if and 
only if the matrix S is unitary (see Problem A.16). Since we always work in 
orthonormal bases, we are interested mainly in unitary similarity transformations. 

While the elements of the matrix representing a given linear transformation 
may look very different in the new basis, two numbers associated with the matrix 


are unchanged: the determinant and the trace. For the determinant of a product is 


the product of the determinants, and hence 
det(T/) = det(ST/S^!) = det(S) det(T’) det(S~') = det T’. [A.65] 


And the trace, which is the sum of the diagonal elements, 


Tr(T) = > I [A.66] 


i=l 
has the property (see Problem A.17) that 
Tr(Ti T2) 2 Tr(T?T |). [A.67] 
(for any two matrices T, and T2), so 


Tr(T/^) = T(ST/S !) = Tr(T,S ! S) = Tr(T^. [A.68] 


Problem A.14 Using the standard basis (f. 7. k) for vectors in three dimensions: 


(a) Construct the matrix representing a rotation through angle @ (counterclock- 
wise, looking down the axis toward the origin) about the z-axis. 


(b) Construct the matrix representing a rotation by 120° (counterclockwise, look- 
ing down the axis) about an axis through the point (1,1.1). 


(c) Construct the matrix representing reflection through the x y-plane. 


(d) Check that all these matrices are orthogonal, and calculate their determinants. 


Problem A.15 In the usual basis (f, f, k), construct the matrix T, representing a 
rotation through angle 0 about the x-axis, and the matrix T, representing a rotation 
through angle 0 about the y-axis. Suppose now we change bases, to?’ = J, J = 
—i, &' = k. Construct the matrix S that effects this change of basis, and check 
that ST,S ^! and ST,S7! are what you would expect. 


Problem A.16 Show that similarity preserves matrix multiplication (that is, if 
A*B' = C*, then A/ B/ = C). Similarity does not, in general, preserve symmetry, 
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reality, or hermiticity; show, however, that if S is unitary, and H° is hermitian, then 
H” is hermitian. Show that S carries an orthonormal basis into another orthonormal 
basis if and only if it is unitary. 


xProblem A.17 Prove that Tr(T; T;) = Tr(T2T)). It follows immediately that 
Tr(T| T2T3) = Tr(T;T3;T,), but is it the case that Tr(T; T2T3) = Tr(T2T,T3), in 
general? Prove it, or disprove it. Hint: The best disproof is always a counterexam- 
ple—the simpler the better! 


A.S EIGENVECTORS AND EIGENVALUES 


Consider the linear transformation in three-space consisting of a rotation, about 
some specified axis, by an angle 0. Most vectors will change in a rather complicated 
way (they ride around on a cone about the axis), but vectors that happen to lie along 
the axis have very simple behavior: They don't change at all (f |a) = |a)). If 6 is 
180°, then vectors which lie in the “equatorial” plane reverse signs ( T|a) = —|a)). 
In a complex vector space!® every linear transformation has "special" vectors like 
these, which are transformed into scalar multiples of themselves: 


f |a) = Ala); [A.69] 


they are called eigenvectors of the transformation, and the (complex) number A 
is their eigenvalue. (The null vector doesn't count, even though in a trivial sense 
it obeys Equation A.69 for any T and any A; technically, an eigenvector is any 
nonzero vector satisfying Equation A.69.) Notice that any (nonzero) multiple of an 
eigenvector is still an eigenvector, with the same eigenvalue. 

With respect to a particular basis, the eigenvector equation assumes the 
matrix form 


Ta = ìa. [A.70] 


(for nonzero a), or 


(T - ADa — 0. [A.71] 


(Here 0 is the zero matrix, whose elements are all zero.) Now, if the matrix (T — AI 
had an inverse, we could multiply both sides of Equation A.71 by (T — AD), and 


l6This is not always true in a real vector space (where the scalars are restricted to real values). 
See Problem A.18. 
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conclude that a = 0. But by assumption a is not zero, so the matrix (T — Al) must 
in fact be singular, which means that its determinant is zero: 


(Ti; — A) Ti? ae Tin 
Tr, (199 =A) ... Ton 
det(T — Al) = . : = 0. [A.72] 
T, Ti» es (Tin — A) 
Expansion of the determinant yields an algebraic equation for A: 
Cyd” + Cy a"! +-+ Cià + Co = 0. [A.73] 


where the coefficients C; depend on the elements of T (see Problem A.20). This is 
called the characteristic equation for the matrix; its solutions determine the eigen- 
values. Notice that it’s an nth-order equation, so (by the fundamental theorem 
of algebra) it has n (complex) roots.!? However, some of these may be multiple 
roots, so all we can say for certain is that an n x n matrix has at least one and 
at most n distinct eigenvalues. The collection of all the eigenvalues of a matrix is 
called its spectrum; if two (or more) linearly independent eigenvectors share the 
same eigenvalue, the spectrum is said to be degenerate. 

To construct the eigenvectors it is generally easiest simply to plug each A 
back into Equation A.70 and solve “by hand” for the components of a. I'll show 
you how it goes by working out an example. 


Example A.1 Find the eigenvalues and eigenvectors of the following matrix: 


2 0 -2 
M-|-3i i 2i |. [A.74] 
1 0 -1 


Solution: The characteristic equation is 


(2—2) 0 2 
9). Gea) w. | =-494+ (147-1 —0. [A.75] 
l @ (Gr) 


and its roots are 0, 1, and i. Call the components of the first eigenvector (a, a2, a3); 
then 


2 0 -2 a, ay 0 
—2i i 2i a3|—0[a|210 
l 0 -I1 a3 a3 0 


7d is here that the case of real vector spaces becomes more awkward, because the characteristic 
equation need not have any (real) solutions at all. See Problem A.18. 
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which yields three equations: 
2a\ = 2a3 =). 
—2ia| + ia» + 2ia3 =0, 
a, —a3 =Q. 
The first determines a3 (in terms of aj): a3 = a1; the second determines a»: a2 = 0; 


and the third is redundant. We may as well pick a, = 1 (since any multiple of an 
eigenvector is still an eigenvector): 


1 
a -[o0]|, fora, =0. [A.76] 
1 


For the second eigenvector (recycling the same notation for the components) we 
have 


2 0-2 al ay aj 
—2i i 2i a|—1|as| [la 
l 0 -1 a3 aa a3 


which leads to the equations 


2a, — 2a3 =a), 
—2ia, +i + 2id3 = an. 


aj — a3 = 3. 


with the solution a3 = (1/2)a), a» = [(1 —7)/2]aq; this time I'll pick a; = 2, so 


2 
a?-[i1-i|. ford, =1. [A.77] 
l 
Finally, for the third eigenvector, 
2 0 -2Y fa; a| ia 
—2i i 2i an) =iļla |= [ie 
1 0 -1 a3 a3 1a3 


which gives the equations 
2a, — 2a3 = ia], 
—2ia, 4- ia» + 2ia3 = i. 


ay — a3 = ida. 
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whose solution is a3 = aj = 0, with a» undetermined. Choosing a» = 1, we 


conclude 
0 
a?-—]|1], for23- i. [A.78] 
0 


If the eigenvectors span the space (as they do in the preceding example), we 
are free to use them as a basis: 


T|fi) = Afi). 
T\ fo) = Asl). 


T\fn) = As fa). 


In this basis the matrix representing 7 takes on a very simple form, with the 
eigenvalues strung out along the main diagonal, and all other elements zero: 


Aj (V... 0 
0 Az... O0 
ES pe ; [A.79] 
] 0 0 ace An 
and the (normalized) eigenvectors are 
1 0 0 
0 l 0 
oi. Web scs. Fa [A.80] 
0 0 l 


A matrix that can be brought to. diagonal form (Equation A.79) by a change 
of basis is said to be diagonalizable (evidently a matrix is diagonalizable if and 
only if its eigenvectors span the space). The similarity matrix that effects the 
diagonalization can be constructed by using the normalized eigenvectors (in the 
old basis) as the columns of S7!: 


(S 5; = (a);. [A.81] 


Example A.2 In Example A.1, 
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so (using Equation A.57) 


-1 0 2 
S= 1 0 —] 
(—1) 1 (1-1) 


and you can check for yourself that 


l 0 0 
Sa) = |0|, Sa2=[1]. Sa = f0 


0 0 l 
and 
0.0 0 
SMS !-[|0 1 0 
00 i 


There's an obvious advantage in bringing a matrix to diagonal form: It's much 
easier to work with. Unfortunately, not every matrix can be diagonalized—the 
eigenvectors have to span the space. If the characteristic equation has n distinct 
roots, then the matrix is certainly diagonalizable, but it may be diagonalizable even 
if there are multiple roots. (For an example of a matrix that cannot be diagonalized, 
see Problem A.19.) It would be handy to know in advance (before working out 
all the eigenvectors) whether a given matrix is diagonalizable. A useful sufficient 
(though not necessary) condition is the following: A matrix is said to be normal 
if it commutes with its hermitian conjugate: 


normal: [N*. N] = 0. [A.82] 


Every normal matrix is diagonalizable (its eigenvectors span the space). In partic- 
ular, every hermitian matrix, and every unitary matrix, is diagonalizable. 

Suppose we have two diagonalizable matrices; in quantum applications the 
question often arises: Can they be simultaneously diagonalized (by the same 
similarity matrix S)? That is to say, does there exist a basis in which they are 
both diagonal? The answer is yes if and only if the two matrices commute (see 
Problem A.22). 


xProblem A.18 The 2 x 2 matrix representing a rotation of the xy plane is 


cosÓÜ  —sinO 
c= E cos Ó ) [A.83] 


Show that (except for certain special angles— what are they?) this matrix has no 
real eigenvalues. (This reflects the geometrical fact that no vector in the plane 
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is carried into itself under such a rotation; contrast rotations in three dimensions.) 
This matrix does, however, have complex eigenvalues and eigenvectors. Find them. 
Construct a matrix S that diagonalizes T. Perform the similarity transformation 
(STS!) explicitly, and show that it reduces T to diagonal form. 


Problem A.19 Find the eigenvalues and eigenvectors of the following matrix: 


Can this matrix be diagonalized? 


Problem A.20 Show that the first, second, and last coefficients in the characteristic 
equation (Equation A.73) are: 


C,-(-1)", C, 2(-ID"-'T(T). and Co = det(T). [A.84] 


For a 3 x 3 matrix with elements 7;;, what is C1? 


Problem A.21 It's obvious that the trace of a diagonal matrix is the sum of its 
eigenvalues, and its determinant is their product (just look at Equation A.79). It 
follows (from Equations A.65 and A.68) that the same holds for any diagonalizable 
matrix. Prove that in fact 


det(T) = AjAa-+-An. Tr(T)2A414 A22 9 aA. [A.85] 


for any matrix. (The A's are the n solutions to the characteristic equation—in the 
case of multiple roots, there may be fewer linearly independent eigenvectors than 
there are solutions, but we still count each à as many times as it occurs.) Hint: 
Write the characteristic equation in the form 


(Ay — A)(A2 — 3) (4 — A) = 0. 


and use the result of Problem A.20. 


Problem A.22 


(a) Show that if two matrices commute in one basis, then they commute in any 
basis. That is: 


[T T£] 2 0 => [T/.T/] - o. [A-86] 


Hint: Use Equation A.64. 
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(b) Show that if two matrices are simultaneously diagonalizable, they 
commute. !8 


Problem A.23 Consider the matrix 


(a) Is it normal? 


(b) Is it diagonalizable? 


A.6 HERMITIAN TRANSFORMATIONS 


In Equation A.48 I defined the hermitian conjugate (or "adjoint") of a matrix as 
its transpose-conjugate: T? = T'. Now I want to give you a more fundamental 
definition for the hermitian conjugate of a /inear transformation: It is that trans- 
formation T* which, when applied to the first member of an inner product, gives 
the same result as if T itself had been applied to the second vector: 


(f*o|8) = (e|T 8). [A.87] 


(for all vectors |o) and |8)).!? I have to warn you that although everybody uses it, 
this is lousy notation. For a and f are not vectors (the vectors are |o) and |B)), 
they are names. In particular, they are endowed with no mathematical properties 
at all, and the expression “TB” is literally nonsense: Linear transformations act on 
vectors, not labels. But it’s pretty clear what the notation means: TB is the name 
of the vector T|8), and (T*a|B) is the inner product of the vector Î* |æ} with the 
vector |£). Notice in particular that 


(alep) = c(a|B). [A.88] 


whereas 
(ca|B) = c" (æl). [A.89] 


for any scalar c. 


I8 Proving the converse (that if two diagonalizable matrices commute then they are simultaneously 
diagonalizable) is not so simple. Sec for example Eugen Merzbacher. Quantum Mechanics. 3rd. cd.. 
Wiley. New York (1998), Scction 10.4. 

19 You may wonder whether such a transformation necessarily exists. Good question! The answer 


is "yes." See, for instance. P. R. Halmos, Finite Dimensional Vector Spaces, 2nd ed.. van Nostrand. 
Princeton (1958), Section 44. 


456 


Appendix Linear Algebra 


If you’re working in an orthonormal basis (as we always shall), the hermitian 
conjugate of a linear transformation is represented by the hermitian conjugate of 
the corresponding matrix; for (using Equations A.50 and A.53), 


(a|7B) = a' Tb = (T'ay'b = (T'a). [A.90] 


So the terminology is consistent, and we can speak interchangeably in the language 
of transformations or of matrices. 

In quantum | mechanics, a fundamental role is played by hermitian transfor- 
mations (Tt = = T). The eigenvectors and eigenvalues of a hermitian transformation 
have three crucial properties: 

1. The eigenvalues of a hermitian transformation are real. 


Proof: Let A be an eigenvalue of T: T |o) = A|a), with |o) Æ |0). Then 
(a|Ta) = (a|Aa) = X(ala). 
Meanwhile, if 7 is hermitian, then 
(a|Ta) = (Tala) = (Ala) = A* (ala). 


But (o|o) zz 0 (Equation A.20), so A = A*, and hence A is real. QED 


2. The eigenvectors of a hermitian transformation belonging to distinct 
eigenvalues are orthogonal. 


Proof: Suppose 7|o) = Ala) and TB) = ul), with A Æ u. Then 
(alf) = (aluß) = uto). 
and if 7 is hermitian, 
(«|78) = (To|B) = (o18) = à* (a18). 


But A = A* (from 1), and A Æ u, by assumption, so (a|B) =0. QED 


3. The eigenvectors of a hermitian transformation span the space. 
As we have seen, this is equivalent to the statement that any hermitian matrix 
can be diagonalized (see Equation A.82). This rather technical fact is, in a 
sense, the mathematical support on which much of quantum mechanics leans. 
It turns out to be a thinner reed than one might have hoped, because the proof 
does not carry over to infinite-dimensional vector spaces. 
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Problem A.24 A hermitian linear transformation must satisfy («|7B) = (Te|B) 
for all vectors |o) and |8). Prove that it is (surprisingly) sufficient that (y|Ty) = 
(Tyly) for all vectors |y). Hint: First let |v) = |o) + |), and then let |y) = 
lo) + i|B). 


xProblem A.25 Let 


l l-i 
Tea 0) 


(a) Verify that T is hermitian. 
(b) Find its eigenvalues (note that they are real). 
(c) Find and normalize the eigenvectors (note that they are orthogonal). 


(d) Construct the unitary diagonalizing matrix S, and check explicitly that it 
diagonalizes T. 


(e) Check that det(T) and Tr(T) are the same for T as they are for its diagonal- 
ized form. 


x Problem A.26 Consider the following hermitian matrix: 


2 i i 
T2|-i 2 i 
l -i 2 


(a) Calculate det(T) and Tr(T). 


(b) Find the eigenvalues of T. Check that their sum and product are consis- 
tent with (a), in the sense of Equation A.85. Write down the diagonalized 
version of T. 


(c) Find the eigenvectors of T. Within the degenerate sector, construct two lin- 
early independent eigenvectors (it is this step that is always possible for a 
hermitian matrix, but not for an arbitrary matrix —contrast Problem A.19). 
Orthogonalize them, and check that both are orthogonal to the third. 
Normalize all three eigenvectors. 


(d) Construct the unitary matrix S that diagonalizes T, and show explicitly that 
the similarity transformation using S reduces T to the appropriate diagonal 
form. 


Problem A.27 A unitary transformation is one for which UU =1. 


(a) Show that unitary transformations preserve inner products, in the sense that 
(Ua |Up) = (a|B), for all vectors |o), |8). 
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(b) Show that the eigenvalues of a unitary transformation have modulus 1. 


(c) Show that the eigenvectors of a unitary transformation belonging to distinct 
eigenvalues are orthogonal. 


x x «Problem A.28 Functions of matrices are defined by their Taylor series expansions; 
for example, 


l l 
e ZIEMEIM SIM +. [A.91] 
(a) Find exp(M), if 
0 1 3 
()M-2|0 0 4]; à) M (*, oP 
00 0 


(b) Show that if M is diagonalizable, then 


det (e^) = TM) [A.92] 


Comment: This is actually true even if M is nor diagonalizable, but it's 
harder to prove in the general case. 


(c) Show that if the matrices M and N commute, then 


Prove (with the simplest counterexample you can think up) that Equation A.93 
is not true, in general, for non-commuting matrices. 


(d) If H is hermitian, show that e! is unitary. 
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Hund's rules, 216, 218 
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binding energy. 151 
fine structure, 266—276 
ground state, 150—151 
hyperfine structure, 283—285 
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radial wave function, 146—157 
radius, 149—151 
spectrum, 157—160 
Stark effect, 289—290, 297—298 
wave functions, 152, 154 
Zeeman effect, 277—283 
hydrogenic atom, 158—159 
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hydrogen molecule ion, 304—309 
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perturbation theory. 249—255 
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WKB approximation, 317—320 
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Kronecker delta, 33-34 
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vectors. 435—438 
linear combination. 27—28, 437 
linear combination of atomic orbitals, 305 
linear independence. 437 
linear operator. 96 
linear transformation, 93. 96-97. 441—444 
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Maxwell's equations. 385 
mean value, 7-9 


measurement, 2-5. 36-37 


cat paradox. 430—431 
destructive. 428—430 


generalized statistical interpretation. 106—109 


indeterminacy, 98 
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operator, 17—18, 42, 97—98, 103, 132 
transfer, 414 


momentum space wave function, 108—109, 191—192 


most probable configuration, 7. 236-239 
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operator, 17 
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lowering, 44—46, 162. 168—169 
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parahelium, 213 
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Paschen-Back effect, 279 
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Pauli exclusion principle, 204, 214, 218-221, 228 
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time-dependent, 340—367 
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geometric, 379 
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phase shift, 405-408 
phase velocity. 64 
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Planck formula, 158, 243—244, 285 
Planck's blackbody formula, 244—245, 355-356 
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operator, 17 
position-momentum uncertainty principle. 111—113 
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Kronig-Penney, 227 
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potential well 
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power series method, 42. 52. 147—149 
principal (p). 215 
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density. 10-11 Rodrigues formula, 57. 136-137 
discrete variables. 5—9 rotating wave approximation. 348 
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selection rules. 359—363 
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time-dependent perturbation theory, 340—367 
Golden Rule. 354 
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